Search results for "GENOME"
showing 10 items of 1913 documents
A Genome-Wide Association Study of Attention Function in a Population-Based Sample of Children
2016
BACKGROUND: Attention function filters and selects behaviorally relevant information. This capacity is impaired in some psychiatric disorders and has been proposed as an endophenotype for Attention-Deficit/Hyperactivity Disorder; however, its genetic basis remains largely unknown. This study aimed to identify single nucleotide polymorphism (SNPs) associated with attention function. MATERIALS AND METHODS: The discovery sample included 1655 children (7-12 years) and the replication sample included 546 children (5-8 years). Five attention outcomes were assessed using the computerized Attentional Network Test (ANT): alerting, orienting, executive attention, Hit Reaction time (HRT) and the stand…
Variants associated with HHIP expression have sex-differential effects on lung function
2021
Background: Lung function is highly heritable and differs between the sexes throughout life. However, little is known about sex-differential genetic effects on lung function. We aimed to conduct the first genome-wide genotype-by-sex interaction study on lung function to identify genetic effects that differ between males and females. Methods: We tested for interactions between 7,745,864 variants and sex on spirometry-based measures of lung function in UK Biobank (N=303,612), and sought replication in 75,696 independent individuals from the SpiroMeta consortium. Results: Five independent single-nucleotide polymorphisms (SNPs) showed genome-wide significant (P<5x10-8) interactions with sex …
Combining multiple hypothesis testing with machine learning increases the statistical power of genome-wide association studies
2016
Mieth, Bettina et al.
Assessing statistical significance in multivariable genome wide association analysis
2016
Motivation: Although Genome Wide Association Studies (GWAS) genotype a very large number of single nucleotide polymorphisms (SNPs), the data are often analyzed one SNP at a time. The low predictive power of single SNPs, coupled with the high significance threshold needed to correct for multiple testing, greatly decreases the power of GWAS. Results: We propose a procedure in which all the SNPs are analyzed in a multiple generalized linear model, and we show its use for extremely high-dimensional datasets. Our method yields P-values for assessing significance of single SNPs or groups of SNPs while controlling for all other SNPs and the family wise error rate (FWER). Thus, our method tests whe…
Two-Stage Bayesian Approach for GWAS With Known Genealogy
2019
Genome-wide association studies (GWAS) aim to assess relationships between single nucleotide polymorphisms (SNPs) and diseases. They are one of the most popular problems in genetics, and have some peculiarities given the large number of SNPs compared to the number of subjects in the study. Individuals might not be independent, especially in animal breeding studies or genetic diseases in isolated populations with highly inbred individuals. We propose a family-based GWAS model in a two-stage approach comprising a dimension reduction and a subsequent model selection. The first stage, in which the genetic relatedness between the subjects is taken into account, selects the promising SNPs. The se…
Evidence for the implication of the histone code in building the genome structure
2018
International audience; Histones are punctuated with small chemical modifications that alter their interaction with DNA. One attractive hypothesis stipulates that certain combinations of these histone modifications may function, alone or together, as a part of a predictive histone code to provide ground rules for chromatin folding. We consider four features that relate histone modifications to chromatin folding: charge neutralisation, molecular specificity, robustness and evolvability. Next, we present evidence for the association among different histone modifications at various levels of chromatin organisation and show how these relationships relate to function such as transcription, repli…
MetaCache: context-aware classification of metagenomic reads using minhashing.
2017
Abstract Motivation Metagenomic shotgun sequencing studies are becoming increasingly popular with prominent examples including the sequencing of human microbiomes and diverse environments. A fundamental computational problem in this context is read classification, i.e. the assignment of each read to a taxonomic label. Due to the large number of reads produced by modern high-throughput sequencing technologies and the rapidly increasing number of available reference genomes corresponding software tools suffer from either long runtimes, large memory requirements or low accuracy. Results We introduce MetaCache—a novel software for read classification using the big data technique minhashing. Our…
Nature lessons: the whitefly bacterial endosymbiont is a minimal amino acid factory with unusual energetics
2016
Reductive genome evolution is a universal phenomenon observed in endosymbiotic bacteria in insects. As the genome reduces its size and irreversibly losses coding genes, the functionalities of the cell system, including the energetics processes, are more restricted. Several energetic pathways can also be lost. How do these reduced metabolic networks sustain the energy needs of the system? Among the bacteria with reduced genomes Candidatus Portiera aleyrodidarum, obligate endosymbiont of whiteflies, represents an extreme case since lacks several key mechanisms for ATP generation. Thus, to analyze the cell energetics in this system, a genome-scale metabolic model of this endosymbiont was const…
panISa: ab initio detection of insertion sequences in bacterial genomes from short read sequence data.
2018
Abstract Motivation The advent of next-generation sequencing has boosted the analysis of bacterial genome evolution. Insertion sequence (IS) elements play a key role in prokaryotic genome organization and evolution, but their repetitions in genomes complicate their detection from short-read data. Results PanISa is a software pipeline that identifies IS insertions ab initio in bacterial genomes from short-read data. It is a highly sensitive and precise tool based on the detection of read-mapping patterns at the insertion site. PanISa performs better than existing IS detection systems as it is based on a database-free approach. We applied it to a high-risk clone lineage of the pathogenic spec…
The intrinsic combinatorial organization and information theoretic content of a sequence are correlated to the DNA encoded nucleosome organization of…
2015
Abstract Motivation: Thanks to research spanning nearly 30 years, two major models have emerged that account for nucleosome organization in chromatin: statistical and sequence specific. The first is based on elegant, easy to compute, closed-form mathematical formulas that make no assumptions of the physical and chemical properties of the underlying DNA sequence. Moreover, they need no training on the data for their computation. The latter is based on some sequence regularities but, as opposed to the statistical model, it lacks the same type of closed-form formulas that, in this case, should be based on the DNA sequence only. Results: We contribute to close this important methodological gap …