Search results for "Sequencing"
showing 10 items of 1087 documents
unitas: the universal tool for annotation of small RNAs
2017
AbstractBackgroundNext generation sequencing is a key technique in small RNA biology research that has led to the discovery of functionally different classes of small non-coding RNAs in the past years. However, reliable annotation of the extensive amounts of small non-coding RNA data produced by high-throughput sequencing is time-consuming and requires robust bioinformatics expertise. Moreover, existing tools have a number of shortcomings including a lack of sensitivity under certain conditions, limited number of supported species or detectable sub-classes of small RNAs.ResultsHere we introduce unitas, an out-of-the-box ready software for complete annotation of small RNA sequence datasets, …
Faster GPU-Accelerated Smith-Waterman Algorithm with Alignment Backtracking for Short DNA Sequences
2014
In this paper, we present a GPU-accelerated Smith-Waterman (SW) algorithm with Alignment Backtracking, called GSWAB, for short DNA sequences. This algorithm performs all-to-all pairwise alignments and retrieves optimal local alignments on CUDA-enabled GPUs. To facilitate fast alignment backtracking, we have investigated a tile-based SW implementation using the CUDA programming model. This tiled computing pattern enables us to more deeply explore the powerful compute capability of GPUs. We have evaluated the performance of GSWAB on a Kepler-based GeForce GTX Titan graphics card. The results show that GSWAB can achieve a performance of up to 56.8 GCUPS on large-scale datasets. Furthermore, ou…
GSWABE: faster GPU-accelerated sequence alignment with optimal alignment retrieval for short DNA sequences
2014
In this paper, we present GSWABE, a graphics processing unit GPU-accelerated pairwise sequence alignment algorithm for a collection of short DNA sequences. This algorithm supports all-to-all pairwise global, semi-global and local alignment, and retrieves optimal alignments on Compute Unified Device Architecture CUDA-enabled GPUs. All of the three alignment types are based on dynamic programming and share almost the same computational pattern. Thus, we have investigated a general tile-based approach to facilitating fast alignment by deeply exploring the powerful compute capability of CUDA-enabled GPUs. The performance of GSWABE has been evaluated on a Kepler-based Tesla K40 GPU using a varie…
The molecular profiling of solid tumors by liquid biopsy: a position paper of the AIOM–SIAPEC-IAP–SIBioC–SIC–SIF Italian Scientific Societies
2021
The term liquid biopsy (LB) refers to the use of various biological fluids as a surrogate for neoplastic tissue to achieve information for diagnostic, prognostic and predictive purposes. In the current clinical practice, LB is used for the identification of driver mutations in circulating tumor DNA derived from both tumor tissue and circulating neoplastic cells. As suggested by a growing body of evidence, however, there are several clinical settings where biological samples other than tissue could be used in the routine practice to identify potentially predictive biomarkers of either response or resistance to targeted treatments. New applications are emerging as useful clinical tools, and o…
Polynucleotide differentiation using hybrid solid-state nanopore functionalizing with α-hemolysin
2019
We report results from full atomistic molecular dynamics simulations on the properties of biomimetic nanopores. This latter result was obtained through the direct insertion of an α-hemolysin protein inside a hydrophobic solid-state nanopore. Upon translocation of different DNA strands, we demonstrate here that the theoretical system presents the same discrimination properties as the experimental one obtained previously. This opens an interesting way to promote the stability of a specific protein inside a solid nanopore to develop further biomimetic applications for DNA or protein sequencing.
Characterization of heat-labile toxin-subunit B from Escherichia coli by liquid chromatography-electrospray ionization-mass spectrometry and matrix-a…
2012
The possibilities of characterizing the heat-labile enterotoxin (LT) of enterotoxigenic Escherichia coli (ETEC) by liquid chromatography electrospray mass spectrometry (LC/ESI-MS) and matrix-assisted laser desorption with time-of-flight mass spectrometry (MALDI-TOF-MS) were investigated. The B subunit from recombinant E. coli (expression in Pichia pastoris) can be detected by LC/ESI-MS expressed in P. pastoris and the charge envelope signals can be observed; LC/ESI-MS and MALDI-TOF-MS analysis allowed the acquisition of labile toxin subunit B (LTB) molecular weight and preliminary structural characterization of LTB toxin. MALDI-TOF analysis after reduction and alkylation of the protein evid…
Alignment-Free Sequence Comparison over Hadoop for Computational Biology
2015
Sequence comparison i.e., The assessment of how similar two biological sequences are to each other, is a fundamental and routine task in Computational Biology and Bioinformatics. Classically, alignment methods are the de facto standard for such an assessment. In fact, considerable research efforts for the development of efficient algorithms, both on classic and parallel architectures, has been carried out in the past 50 years. Due to the growing amount of sequence data being produced, a new class of methods has emerged: Alignment-free methods. Research in this ares has become very intense in the past few years, stimulated by the advent of Next Generation Sequencing technologies, since those…
CROSSMAPPER: estimating cross-mapping rates and optimizing experimental design in multi-species sequencing studies
2020
Motivation Numerous sequencing studies, including transcriptomics of host-pathogen systems, sequencing of hybrid genomes, xenografts, mixed species systems, metagenomics and meta-transcriptomics, involve samples containing genetic material from divergent organisms. A crucial step in these studies is identifying from which organism each sequencing read originated, and the experimental design should be directed to minimize biases caused by cross-mapping of reads to incorrect source genomes. Additionally, pooling of sufficiently different genetic material into a single sequencing library could significantly reduce experimental costs but requires careful planning and assessment of the impact of…
Cluster-Localized Sparse Logistic Regression for SNP Data
2012
The task of analyzing high-dimensional single nucleotide polymorphism (SNP) data in a case-control design using multivariable techniques has only recently been tackled. While many available approaches investigate only main effects in a high-dimensional setting, we propose a more flexible technique, cluster-localized regression (CLR), based on localized logistic regression models, that allows different SNPs to have an effect for different groups of individuals. Separate multivariable regression models are fitted for the different groups of individuals by incorporating weights into componentwise boosting, which provides simultaneous variable selection, hence sparse fits. For model fitting, th…
Acceleration of short and long DNA read mapping without loss of accuracy using suffix array
2014
HPG Aligner applies suffix arrays for DNA read mapping. This implementation produces a highly sensitive and extremely fast mapping of DNA reads that scales up almost linearly with read length. The approach presented here is faster (over 20 for long reads) and more sensitive (over 98% in a wide range of read lengths) than the current state-of-the-art mappers. HPG Aligner is not only an optimal alternative for current sequencers but also the only solution available to cope with longer reads and growing throughputs produced by forthcoming sequencing technologies.