Search results for "High-throughput"
showing 10 items of 292 documents
RabbitQC: high-speed scalable quality control for sequencing data
2019
Abstract Motivation Modern sequencing technologies continue to revolutionize many areas of biology and medicine. Since the generated datasets are error-prone, downstream applications usually require quality control methods to pre-process FASTQ files. However, existing tools for this task are currently not able to fully exploit the capabilities of computing platforms leading to slow runtimes. Results We present RabbitQC, an extremely fast integrated quality control tool for FASTQ files, which can take full advantage of modern hardware. It includes a variety of operations and supports different sequencing technologies (Illumina, Oxford Nanopore and PacBio). RabbitQC achieves speedups between …
An unusually high substitution rate in transplant-associated BK polyomavirus in vivo is further concentrated in HLA-C-bound viral peptides
2018
Infection with human BK polyomavirus, a small double-stranded DNA virus, potentially results in severe complications in immunocompromised patients. Here, we describe the in vivo variability and evolution of the BK polyomavirus by deep sequencing. Our data reveal the highest genomic evolutionary rate described in double-stranded DNA viruses, i.e., 10−3–10−5 substitutions per nucleotide site per year. High mutation rates in viruses allow their escape from immune surveillance and adaptation to new hosts. By combining mutational landscapes across viral genomes with in silico prediction of viral peptides, we demonstrate the presence of significantly more coding substitutions within predicted cog…
Large-scale analysis of SARS-CoV-2 spike-glycoprotein mutants demonstrates the need for continuous screening of virus isolates
2021
Due to the widespread of the COVID-19 pandemic, the SARS-CoV-2 genome is evolving in diverse human populations. Several studies already reported different strains and an increase in the mutation rate. Particularly, mutations in SARS-CoV-2 spike-glycoprotein are of great interest as it mediates infection in human and recently approved mRNA vaccines are designed to induce immune responses against it. We analyzed 1,036,030 SARS-CoV-2 genome assemblies and 30,806 NGS datasets from GISAID and European Nucleotide Archive (ENA) focusing on non-synonymous mutations in the spike protein. Only around 2.5% of the samples contained the wild-type spike protein with no variation from the reference. Among…
Two distinct extracellular RNA signatures released by a single cell type identified by microarray and next-generation sequencing
2016
ABSTRACT Cells secrete extracellular RNA (exRNA) to their surrounding environment and exRNA has been found in many body fluids such as blood, breast milk and cerebrospinal fluid. However, there are conflicting results regarding the nature of exRNA. Here, we have separated 2 distinct exRNA profiles released by mast cells, here termed high-density (HD) and low-density (LD) exRNA. The exRNA in both fractions was characterized by microarray and next-generation sequencing. Both exRNA fractions contained mRNA and miRNA, and the mRNAs in the LD exRNA correlated closely with the cellular mRNA, whereas the HD mRNA did not. Furthermore, the HD exRNA was enriched in lincRNA, antisense RNA, vault RNA, …
Large scale preparation of human MHC class II+ integrin beta(1)+ Tregs.
2010
Abstract The human CD4 + CD25 + FoxP3 + regulatory T cell population (Tregs) contains both MHC class II + and MHC class II − cells. MHC class II + Tregs belong to the integrin α 4 β 1 + subpopulation and exclusively execute contact-dependent suppressive activity. Here we present a method optimized for isolation of these MHC class II expressing Tregs from large leukaphereses products using magnetic microbeads that achieves a reproducible purity of more than 90% and enables the use of this small-sized Treg population in pre-clinical application and basic research.
CUSHAW3: Sensitive and Accurate Base-Space and Color-Space Short-Read Alignment with Hybrid Seeding
2014
The majority of next-generation sequencing short-reads can be properly aligned by leading aligners at high speed. However, the alignment quality can still be further improved, since usually not all reads can be correctly aligned to large genomes, such as the human genome, even for simulated data. Moreover, even slight improvements in this area are important but challenging, and usually require significantly more computational endeavor. In this paper, we present CUSHAW3, an open-source parallelized, sensitive and accurate short-read aligner for both base-space and color-space sequences. In this aligner, we have investigated a hybrid seeding approach to improve alignment quality, which incorp…
Direct squencing from the minimal number of DNA molecules needed to fill a 454 picotiterplate
2014
Notice of Republication: This article was republished on June 17, 2014, to correct an error in the title. The publisher apologizes for the error. In addition, a typographical error was corrected in the Abstract. Please download this article again to view the correct version. The originally published, uncorrected article and the republished, corrected article are provided here for reference.
Next-Generation Sequencing-Based RiboMethSeq Protocol for Analysis of tRNA 2'-O-Methylation.
2016
Analysis of RNA modifications by traditional physico-chemical approaches is labor intensive, requires substantial amounts of input material and only allows site-by-site measurements. The recent development of qualitative and quantitative approaches based on next-generation sequencing (NGS) opens new perspectives for the analysis of various cellular RNA species. The Illumina sequencing-based RiboMethSeq protocol was initially developed and successfully applied for mapping of ribosomal RNA (rRNA) 2'-O-methylations. This method also gives excellent results in the quantitative analysis of rRNA modifications in different species and under varying growth condi…
Compressive biological sequence analysis and archival in the era of high-throughput sequencing technologies
2013
High-throughput sequencing technologies produce large collections of data, mainly DNA sequences with additional information, requiring the design of efficient and effective methodologies for both their compression and storage. In this context, we first provide a classification of the main techniques that have been proposed, according to three specific research directions that have emerged from the literature and, for each, we provide an overview of the current techniques. Finally, to make this review useful to researchers and technicians applying the existing software and tools, we include a synopsis of the main characteristics of the described approaches, including details on their impleme…
unitas: the universal tool for annotation of small RNAs
2017
AbstractBackgroundNext generation sequencing is a key technique in small RNA biology research that has led to the discovery of functionally different classes of small non-coding RNAs in the past years. However, reliable annotation of the extensive amounts of small non-coding RNA data produced by high-throughput sequencing is time-consuming and requires robust bioinformatics expertise. Moreover, existing tools have a number of shortcomings including a lack of sensitivity under certain conditions, limited number of supported species or detectable sub-classes of small RNAs.ResultsHere we introduce unitas, an out-of-the-box ready software for complete annotation of small RNA sequence datasets, …