Search results for "Sequence analysis"
showing 10 items of 1349 documents
Identification and visualization of differential isoform expression in RNA-seq time series
2018
Abstract Motivation As sequencing technologies improve their capacity to detect distinct transcripts of the same gene and to address complex experimental designs such as longitudinal studies, there is a need to develop statistical methods for the analysis of isoform expression changes in time series data. Results Iso-maSigPro is a new functionality of the R package maSigPro for transcriptomics time series data analysis. Iso-maSigPro identifies genes with a differential isoform usage across time. The package also includes new clustering and visualization functions that allow grouping of genes with similar expression patterns at the isoform level, as well as those genes with a shift in major …
panISa: ab initio detection of insertion sequences in bacterial genomes from short read sequence data.
2018
Abstract Motivation The advent of next-generation sequencing has boosted the analysis of bacterial genome evolution. Insertion sequence (IS) elements play a key role in prokaryotic genome organization and evolution, but their repetitions in genomes complicate their detection from short-read data. Results PanISa is a software pipeline that identifies IS insertions ab initio in bacterial genomes from short-read data. It is a highly sensitive and precise tool based on the detection of read-mapping patterns at the insertion site. PanISa performs better than existing IS detection systems as it is based on a database-free approach. We applied it to a high-risk clone lineage of the pathogenic spec…
dAPE: a web server to detect homorepeats and follow their evolution.
2016
Abstract Summary Homorepeats are low complexity regions consisting of repetitions of a single amino acid residue. There is no current consensus on the minimum number of residues needed to define a functional homorepeat, nor even if mismatches are allowed. Here we present dAPE, a web server that helps following the evolution of homorepeats based on orthology information, using a sensitive but tunable cutoff to help in the identification of emerging homorepeats. Availability and Implementation dAPE can be accessed from http://cbdm-01.zdv.uni-mainz.de/∼munoz/polyx. Supplementary information Supplementary data are available at Bioinformatics online.
AFS: identification and quantification of species composition by metagenomic sequencing
2017
Abstract Summary DNA-based methods to detect and quantify taxon composition in biological materials are often based on species-specific polymerase chain reaction, limited to detecting species targeted by the assay. Next-generation sequencing overcomes this drawback by untargeted shotgun sequencing of whole metagenomes at affordable cost. Here we present AFS, a software pipeline for quantification of species composition in food. AFS uses metagenomic shotgun sequencing and sequence read counting to infer species proportions. Using Illumina data from a reference sausage comprising four species, we reveal that AFS is independent of the sequencing assay and library preparation protocol. Cost-sav…
In vitro versus in vivo compositional landscapes of histone sequence preferences in eucaryotic genomes
2018
Abstract Motivation Although the nucleosome occupancy along a genome can be in part predicted by in vitro experiments, it has been recently observed that the chromatin organization presents important differences in vitro with respect to in vivo. Such differences mainly regard the hierarchical and regular structures of the nucleosome fiber, whose existence has long been assumed, and in part also observed in vitro, but that does not apparently occur in vivo. It is also well known that the DNA sequence has a role in determining the nucleosome occupancy. Therefore, an important issue is to understand if, and to what extent, the structural differences in the chromatin organization between in vit…
A Clonal Lineage of Fusarium oxysporum Circulates in the Tap Water of Different French Hospitals.
2016
ABSTRACT Fusarium oxysporum is typically a soilborne fungus but can also be found in aquatic environments. In hospitals, water distribution systems may be reservoirs for the fungi responsible for nosocomial infections. F. oxysporum was previously detected in the water distribution systems of five French hospitals. Sixty-eight isolates from water representative of all hospital units that were previously sampled and characterized by translation elongation factor 1α sequence typing were subjected to microsatellite analysis and full-length ribosomal intergenic spacer (IGS) sequence typing. All but three isolates shared common microsatellite loci and a common two-locus sequence type (ST). This S…
Inhabiting plant roots, nematodes, and truffles—polyphilus, a new helotialean genus with two globally distributed species
2018
Fungal root endophytes, including the common group of dark septate endophytes (DSEs), represent different taxonomic groups and potentially diverse life strategies. In this study, we investigated two unidentified helotialean lineages found previously in a study of DSE fungi of semiarid grasslands, from several other sites, and collected recently from a pezizalean truffle ascoma and eggs of the cereal cyst nematode Heterodera filipjevi. The taxonomic positions and phylogenetic relationships of 21 isolates with different hosts and geographic origins were studied in detail. Four loci, namely, nuc rDNA ITS1-5.8S-ITS2 (internal transcribed spacer [ITS]), partial 28S nuc rDNA (28S), partial 18S nu…
Evolutionary History of the Nesophontidae, the Last Unplaced Recent Mammal Family
2016
The mammalian evolutionary tree has lost several major clades through recent human-caused extinctions. This process of historical biodiversity loss has particularly affected tropical island regions such as the Caribbean, an area of great evolutionary diversification but poor molecular preservation. The most enigmatic of the recently extinct endemic Caribbean mammals are the Nesophontidae, a family of morphologically plesiomorphic lipotyphlan insectivores with no consensus on their evolutionary affinities, and which constitute the only major recent mammal clade to lack any molecular information on their phylogenetic placement. Here, we use a palaeogenomic approach to place Nesophontidae with…
Parallel and Space-Efficient Construction of Burrows-Wheeler Transform and Suffix Array for Big Genome Data
2016
Next-generation sequencing technologies have led to the sequencing of more and more genomes, propelling related research into the era of big data. In this paper, we present ParaBWT, a parallelized Burrows-Wheeler transform (BWT) and suffix array construction algorithm for big genome data. In ParaBWT, we have investigated a progressive construction approach to constructing the BWT of single genome sequences in linear space complexity, but with a small constant factor. This approach has been further parallelized using multi-threading based on a master-slave coprocessing model. After gaining the BWT, the suffix array is constructed in a memory-efficient manner. The performance of ParaBWT has b…
Accelerating metagenomic read classification on CUDA-enabled GPUs.
2016
Metagenomic sequencing studies are becoming increasingly popular with prominent examples including the sequencing of human microbiomes and diverse environments. A fundamental computational problem in this context is read classification; i.e. the assignment of each read to a taxonomic label. Due to the large number of reads produced by modern high-throughput sequencing technologies and the rapidly increasing number of available reference genomes software tools for fast and accurate metagenomic read classification are urgently needed. We present cuCLARK, a read-level classifier for CUDA-enabled GPUs, based on the fast and accurate classification of metagenomic sequences using reduced k-mers (…