Search results for "IMPUTATION"

showing 10 items of 57 documents

Interpretable machine learning models for single-cell ChIP-seq imputation

2019

AbstractMotivationSingle-cell ChIP-seq (scChIP-seq) analysis is challenging due to data sparsity. High degree of data sparsity in biological high-throughput single-cell data is generally handled with imputation methods that complete the data, but specific methods for scChIP-seq are lacking. We present SIMPA, a scChIP-seq data imputation method leveraging predictive information within bulk data from ENCODE to impute missing protein-DNA interacting regions of target histone marks or transcription factors.ResultsImputations using machine learning models trained for each single cell, each target, and each genomic region accurately preserve cell type clustering and improve pathway-related gene i…

Computer sciencebusiness.industryCell chipPython (programming language)Machine learningcomputer.software_genreENCODEIdentification (information)Simulated dataFeature (machine learning)Imputation (statistics)Artificial intelligenceCluster analysisbusinesscomputercomputer.programming_language
researchProduct

Regression imputation for Space-Time datasets with missing values

2009

Data consisting in repeated observations on a series of fixed units are very common in different context like biological, environmental and social sciences, and different terminology is often used to indicate this kind of data: panel data, longitudinal data, time series-cross section data (TSCS), spatio-temporal data. Missing information are inevitable in longitudinal studies, and can produce biased estimates and loss of powers. The aim of this paper is to propose a new regression (single) imputation method that, considering the particular structure and characteristics of the data set, creates a “complete” data set that can be analyzed by any researcher on different occasions and using diff…

Cross-sectional dataSpace timeMissing datacomputer.software_genreRegressionTerminologyGeographyStatisticsSpace-time data imputationPerformance indicatorImputation (statistics)Data miningSettore SECS-S/01 - StatisticacomputerPanel data
researchProduct

Genome-Wide Haplotype Analysis of Cis Expression Quantitative Trait Loci in Monocytes

2013

In order to assess whether gene expression variability could be influenced by several SNPs acting in cis, either through additive or more complex haplotype effects, a systematic genome-wide search for cis haplotype expression quantitative trait loci (eQTL) was conducted in a sample of 758 individuals, part of the Cardiogenics Transcriptomic Study, for which genome-wide monocyte expression and GWAS data were available. 19,805 RNA probes were assessed for cis haplotypic regulation through investigation of ∼2,1×109 haplotypic combinations. 2,650 probes demonstrated haplotypic p-values >104-fold smaller than the best single SNP p-value. Replication of significant haplotype effects were tested f…

Cancer Researchmedicine.medical_specialtyHereditylcsh:QH426-470Immune Cells[SDV]Life Sciences [q-bio]Quantitative Trait LociImmunologyGene ExpressionGenome-wide association studySingle-nucleotide polymorphismQuantitative trait locusBiologyRegulatory Sequences Nucleic AcidPolymorphism Single NucleotideMonocytes03 medical and health sciences0302 clinical medicineMolecular geneticsmedicineGeneticsGenome-Wide Association StudiesSNPHumansGenetic Predisposition to DiseaseMolecular BiologyBiologyGenetics (clinical)Ecology Evolution Behavior and Systematics030304 developmental biologyGenetics0303 health sciencesQuantitative TraitsComplex TraitsHaplotypeGenomicslcsh:GeneticsGene Expression RegulationHaplotypesExpression quantitative trait lociGenome Expression Analysis030217 neurology & neurosurgeryImputation (genetics)Population GeneticsGenome-Wide Association StudyResearch Article
researchProduct

Air quality and integration of short-term and long-term pollutant data

2008

Modelling PM10 is an important problem in statistical methodology, above all to explain the PM10 behaviour in space and time, since it has been linked to many adverse effects on human and environmental health. But the large spatial variability of the main traffic-related pollutants, and in particular here the PM10, implies the impossibility of obtaining from the data of the fixed stations a complete pictures of the atmospheric pollution in the urban areas. Information from fixed monitoring stations (long-term measurements) are therefore integrated with the ones deriving from mobile station (short-term measurements). Short-term measurements are incomplete and so it is necessary to integrate …

Settore SECS-S/01 - StatisticaPollution short-term series PM10 missing values single imputation method
researchProduct

2015

Hearing loss and individual differences in normal hearing both have a substantial genetic basis. Although many new genes contributing to deafness have been identified, very little is known about genes/variants modulating the normal range of hearing ability. To fill this gap, we performed a two-stage meta-analysis on hearing thresholds (tested at 0.25, 0.5, 1, 2, 4, 8 kHz) and on pure-tone averages (low-, medium- and high-frequency thresholds grouped) in several isolated populations from Italy and Central Asia (total N = 2636). Here, we detected two genome-wide significant loci close to PCDH20 and SLC28A3 (top hits: rs78043697, P = 4.71E-10 and rs7032430, P = 2.39E-09, respectively). For bot…

Genetics0303 health sciencesSequence analysisHearing lossGenome-wide association studySingle-nucleotide polymorphismGeneral MedicineBiologyGenome03 medical and health sciences0302 clinical medicineGenotypeotorhinolaryngologic diseasesGeneticsmedicinemedicine.symptomMolecular BiologyGene030217 neurology & neurosurgeryGenetics (clinical)Imputation (genetics)030304 developmental biologyHuman Molecular Genetics
researchProduct

Bayesian models for data missing not at random in health examination surveys

2018

In epidemiological surveys, data missing not at random (MNAR) due to survey nonresponse may potentially lead to a bias in the risk factor estimates. We propose an approach based on Bayesian data augmentation and survival modelling to reduce the nonresponse bias. The approach requires additional information based on follow-up data. We present a case study of smoking prevalence using FINRISK data collected between 1972 and 2007 with a follow-up to the end of 2012 and compare it to other commonly applied missing at random (MAR) imputation approaches. A simulation experiment is carried out to study the validity of the approaches. Our approach appears to reduce the nonresponse bias substantially…

Statistics and ProbabilityFOS: Computer and information sciencesmedicine.medical_specialtymultiple imputationComputer scienceBayesian probability01 natural sciencesStatistics - Applicationssurvival analysisfollow-up dataMethodology (stat.ME)010104 statistics & probability03 medical and health sciencesHealth examination0302 clinical medicineEpidemiologyStatisticsmedicineApplications (stat.AP)030212 general & internal medicine0101 mathematicsSurvival analysisStatistics - MethodologyBayes estimatorta112elinaika-analyysiRisk factor (computing)Bayesian estimation3. Good healthhealth examination surveysStatistics Probability and UncertaintyMissing not at randomdata augmentation
researchProduct

Item nonresponse and imputation strategies in SHARE Wave 5

2015

This chapter focuses on item nonresponse in the fifth wave of SHARE and the imputation strategies adopted to fill-in the missing values.

SHARE; Item nonresponse; Imputation strategiesSHARESettore SECS-P/05 - EconometriaImputation strategiesItem nonresponse
researchProduct

Identification of patterns og change on mongitudinal data, illustrated by two exemples : study of hospital pathways in the management of cancer. Cons…

2014

Context In healthcare domain, data mining for knowledge discovery represent a growing issue. Questions about the organisation of healthcare system and the study of the relation between treatment and quality of life (QoL) perceived could be addressed that way. The evolution of technologies provides us with efficient data mining tools and statistical packages containing advanced methods available for non-experts. We illustrate this approach through two issues: 1 / What organisation of healthcare system for cancer diseases management? 2 / Exploring in patients suffering from metastatic cancer, the relationship between health-related QoL perceived and treatment received as part of a clinical tr…

Quality of lifeQualité de viesTrajectoire de soins[SDV.MHEP] Life Sciences [q-bio]/Human health and pathologyMultiple imputationImputation de donnéesFouille de donnéesClassificationCancersData miningTrajectory of careClusteringCancer
researchProduct

Comparison of HapMap and 1000 genomes reference panels in a large-scale genome-wide association study

2017

An increasing number of genome-wide association (GWA) studies are now using the higher resolution 1000 Genomes Project reference panel (1000G) for imputation, with the expectation that 1000G imputation will lead to the discovery of additional associated loci when compared to HapMap imputation. In order to assess the improvement of 1000G over HapMap imputation in identifying associated loci, we compared the results of GWA studies of circulating fibrinogen based on the two reference panels. Using both HapMap and 1000G imputation we performed a meta-analysis of 22 studies comprising the same 91,953 individuals. We identified six additional signals using 1000G imputation, while 29 loci were ass…

Netherlands Twin Register (NTR)0301 basic medicineGlycobiologySocial Scienceslcsh:MedicineGenome-wide association study030105 genetics & heredityBiochemistryMathematical and Statistical TechniquesSociologyCell SignalingConsortiaGENETIC-VARIANTSMedicine and Health SciencesIMPUTATIONInternational HapMap Projectlcsh:ScienceGeneticsMultidisciplinaryCOMMON VARIANTSGenomicsMultidisciplinary SciencesINSIGHTSCARDIOVASCULAR-DISEASEPhysical SciencessymbolsScience & Technology - Other TopicsHealth Services ResearchGenomic Signal ProcessingStatistics (Mathematics)Research ArticleSignal TransductionGenotypingSUSCEPTIBILITY LOCIGeneral Science & TechnologyBIOLOGYSingle-nucleotide polymorphismGenomicsHapMap ProjectComputational biologyPRESSUREBiologyResearch and Analysis Methods03 medical and health sciencessymbols.namesakeMD MultidisciplinaryGenome-Wide Association StudiesGeneticsJournal Article/dk/atira/pure/keywords/cohort_studies/netherlands_twin_register_ntr_HumansStatistical Methods1000 Genomes ProjectMolecular Biology TechniquesMolecular BiologyMETAANALYSISGlycoproteinsScience & Technologylcsh:RHuman GenomeCONSORTIUMBiology and Life SciencesComputational BiologyFibrinogenHuman GeneticsCell BiologyComparative GenomicsGenome AnalysisHealth Care030104 developmental biologyBonferroni correctionlcsh:QHaplotype estimationMathematicsImputation (genetics)Meta-AnalysisGenome-Wide Association Study
researchProduct

Genome-wide Analyses Identify KIF5A as a Novel ALS Gene

2018

© 2018 Elsevier Inc.

MaleAls geneGenome-wide association studyFAMILIAL ALSALS; axonal transport; cargo; GWAS; KIF5A; WES; WGS0302 clinical medicine80 and overPsychologyGWASKIF5AAetiologycargoAged 80 and over0303 health sciencesFrench ALS ConsortiumKinesinKINESIN HEAVY-CHAINCognitive Sciencesaxonal transportHumanHereditary spastic paraplegiaNeuroscience(all)Single-nucleotide polymorphismTARGETED DISRUPTIONArticle03 medical and health sciencesGeneticsHumansAmino Acid SequenceLoss functionAgedHEXANUCLEOTIDE REPEATNeuroscience (all)MUTATIONSAmyotrophic Lateral Sclerosis3112 Neurosciences1702 Cognitive Sciencemedicine.diseaseITALSGEN ConsortiumAnswer ALS Foundation030104 developmental biologyALS Sequencing ConsortiumHuman medicine1109 Neurosciences030217 neurology & neurosurgery0301 basic medicineALS; GWAS; KIF5A; WES; WGS; axonal transport; cargo[SDV]Life Sciences [q-bio]KinesinsNeurodegenerativeGenetic analysisGenomeAMYOTROPHIC-LATERAL-SCLEROSIS3124 Neurology and psychiatryCohort StudiesPathogenesisLoss of Function MutationMissense mutation2.1 Biological and endogenous factorsAmyotrophic lateral sclerosisNYGC ALS ConsortiumGeneticsGeneral NeuroscienceALS axonal transport cargo GWAS KIF5A WES WGSMiddle AgedPhenotypeSettore MED/26 - NEUROLOGIANeurologicalProject MinE ALS Sequencing ConsortiumKinesinWESFemaleAdultBiologyGENOTYPE IMPUTATIONALS; axonal transport; cargo; GWAS; KIF5A; WES; WGS; Adult; Aged; Aged 80 and over; Amino Acid Sequence; Amyotrophic Lateral Sclerosis; Cohort Studies; Female; Genome-Wide Association Study; Humans; Kinesin; Loss of Function Mutation; Male; Middle Aged; Young AdultNOYoung AdultRare DiseasesmedicineSLAGEN ConsortiumGene030304 developmental biologyClinical Research in ALS and Related Disorders for Therapeutic Development (CReATe) ConsortiumNeurology & NeurosurgeryHuman GenomeNeurosciencesAXONAL-TRANSPORTBrain DisordersALS; axonal transport; cargo; GWAS; KIF5A; WES; WGS;Family memberDNA-DAMAGEMOTOR-NEURONS3111 BiomedicineCohort StudieALSGenomic Translation for ALS Care (GTAC) ConsortiumWGSAmyotrophic Lateral SclerosiGenome-Wide Association StudyALS; axonal transport; cargo; GWAS; KIF5A; WES; WGS; Neuroscience (all)
researchProduct