Search results for "genomics"
showing 10 items of 1255 documents
Efficient change point detection in genomic sequences of continuous measurements
2010
Abstract Motivation: Knowing the exact locations of multiple change points in genomic sequences serves several biological needs, for instance when data represent aCGH profiles and it is of interest to identify possibly damaged genes involved in cancer and other diseases. Only a few of the currently available methods deal explicitly with estimation of the number and location of change points, and moreover these methods may be somewhat vulnerable to deviations of model assumptions usually employed. Results: We present a computationally efficient method to obtain estimates of the number and location of the change points. The method is based on a simple transformation of data and it provides re…
Additional file 4 of Development and validation of prediction model to estimate 10-year risk of all-cause mortality using modern statistical learning…
2021
Additional file 4. Distributions of the variables at baseline before and after multiple imputations.
ChIP-Seq from Limited Starting Material of K562 Cells and Drosophila Neuroblasts Using Tagmentation Assisted Fragmentation Approach
2019
Chromatin immunoprecipitation is extensively used to investigate the epigenetic profile and transcription factor binding sites in the genome. However, when the starting material is limited, the conventional ChIP-Seq approach cannot be implemented. This protocol describes a method that can be used to generate the chromatin profiles from as low as 100 human or 1,000 Drosophila cells. The method employs tagmentation to fragment the chromatin with concomitant addition of sequencing adaptors. The method generates datasets with high signal to noise ratio and can be subjected to standard tools for ChIP-Seq analysis.
Animal rennets as sources of dairy lactic acid bacteria
2014
ABSTRACT The microbial composition of artisan and industrial animal rennet pastes was studied by using both culture-dependent and -independent approaches. Pyrosequencing targeting the 16S rRNA gene allowed to identify 361 operational taxonomic units (OTUs) to the genus/species level. Among lactic acid bacteria (LAB), Streptococcus thermophilus and some lactobacilli, mainly Lactobacillus crispatus and Lactobacillus reuteri , were the most abundant species, with differences among the samples. Twelve groups of microorganisms were targeted by viable plate counts revealing a dominance of mesophilic cocci. All rennets were able to acidify ultrahigh-temperature-processed (UHT) milk as shown by pH …
One is not enough: On the effects of reference genome for the mapping and subsequent analyses of short-reads.
2020
Mapping of high-throughput sequencing (HTS) reads to a single arbitrary reference genome is a frequently used approach in microbial genomics. However, the choice of a reference may represent a source of errors that may affect subsequent analyses such as the detection of single nucleotide polymorphisms (SNPs) and phylogenetic inference. In this work, we evaluated the effect of reference choice on short-read sequence data from five clinically and epidemiologically relevant bacteria (Klebsiella pneumoniae, Legionella pneumophila, Neisseria gonorrhoeae, Pseudomonas aeruginosa and Serratia marcescens). Publicly available whole-genome assemblies encompassing the genomic diversity of these species…
Focal DNA Copy Number Changes in Neuroblastoma Target MYCN Regulated Genes
2013
Neuroblastoma is an embryonic tumor arising from immature sympathetic nervous system cells. Recurrent genomic alterations include MYCN and ALK amplification as well as recurrent patterns of gains and losses of whole or large partial chromosome segments. A recent whole genome sequencing effort yielded no frequently recurring mutations in genes other than those affecting ALK. However, the study further stresses the importance of DNA copy number alterations in this disease, in particular for genes implicated in neuritogenesis. Here we provide additional evidence for the importance of focal DNA copy number gains and losses, which are predominantly observed in MYCN amplified tumors. A focal 5 kb…
Comparison of different assembly and annotation tools on analysis of simulated viral metagenomic communities in the gut
2013
Abstract Background The main limitations in the analysis of viral metagenomes are perhaps the high genetic variability and the lack of information in extant databases. To address these issues, several bioinformatic tools have been specifically designed or adapted for metagenomics by improving read assembly and creating more sensitive methods for homology detection. This study compares the performance of different available assemblers and taxonomic annotation software using simulated viral-metagenomic data. Results We simulated two 454 viral metagenomes using genomes from NCBI's RefSeq database based on the list of actual viruses found in previously published metagenomes. Three different ass…
The histone deacetylase Rpd3 regulates the heterochromatin structure of Drosophila telomeres
2011
Telomeres are specialized structures at the end of eukaryotic chromosomes that are required to preserve genome integrity, chromosome stability and nuclear architecture. Telomere maintenance and function are established epigenetically in several eukaryotes. However, the exact chromatin enzymatic modifications regulating telomere homeostasis are poorly understood. In Drosophila melanogaster, telomere length and stability are maintained through the retrotransposition of specialized telomeric sequences and by the specific loading of protecting capping proteins, respectively. Here, we show that the loss of the essential and evolutionarily conserved histone deacetylase Rpd3, the homolog of mammal…
Whole genome sequencing of the black grouse (Tetrao tetrix): reference guided assembly suggests faster-Z and MHC evolution
2014
Background The different regions of a genome do not evolve at the same rate. For example, comparative genomic studies have suggested that the sex chromosomes and the regions harbouring the immune defence genes in the Major Histocompatability Complex (MHC) may evolve faster than other genomic regions. The advent of the next generation sequencing technologies has made it possible to study which genomic regions are evolutionary liable to change and which are static, as well as enabling an increasing number of genome studies of non-model species. However, de novo sequencing of the whole genome of an organism remains non-trivial. In this study, we present the draft genome of the black grouse, wh…
Evaluation of GPU-based Seed Generation for Computational Genomics Using Burrows-Wheeler Transform
2012
Unprecedented production of short reads from the new high-throughput sequencers has posed challenges to align short reads to reference genomes with high sensitivity and high speed. Many CPU-based short read aligners have been developed to address this challenge. Among them, one popular approach is the seed-and-extend heuristic. For this heuristic, the first and foremost step is to generate seeds between the input reads and the reference genome, where hash tables are the most frequently used data structure. However, hash tables are memory-consuming, making it not well-suited to memory-stringent many-core architectures, like GPUs, even though they usually have a nearly constant query time com…