Search results for "DATA"

showing 10 items of 12992 documents

Alignment-free sequence comparison using absent words

2018

Sequence comparison is a prerequisite to virtually all comparative genomic analyses. It is often realised by sequence alignment techniques, which are computationally expensive. This has led to increased research into alignment-free techniques, which are based on measures referring to the composition of sequences in terms of their constituent patterns. These measures, such as $q$-gram distance, are usually computed in time linear with respect to the length of the sequences. In this paper, we focus on the complementary idea: how two sequences can be efficiently compared based on information that does not occur in the sequences. A word is an {\em absent word} of some sequence if it does not oc…

0301 basic medicineFOS: Computer and information sciencesFormal Languages and Automata Theory (cs.FL)Computer Science - Formal Languages and Automata TheorySequence alignmentInformation System0102 computer and information sciencesCircular wordAbsent words01 natural sciencesUpper and lower boundsSequence comparisonTheoretical Computer ScienceCombinatorics03 medical and health sciencesComputer Science - Data Structures and AlgorithmsData Structures and Algorithms (cs.DS)Absent wordCircular wordsMathematicsSequenceSettore INF/01 - InformaticaProcess (computing)q-gramComputer Science Applications1707 Computer Vision and Pattern Recognitionq-gramsComposition (combinatorics)Computer Science Applications030104 developmental biologyComputational Theory and MathematicsForbidden words010201 computation theory & mathematicsFocus (optics)Forbidden wordWord (computer architecture)Information SystemsInteger (computer science)
researchProduct

Integrative analysis of structural variations using short-reads and linked-reads yields highly specific and sensitive predictions.

2020

Genetic diseases are driven by aberrations of the human genome. Identification of such aberrations including structural variations (SVs) is key to our understanding. Conventional short-reads whole genome sequencing (cWGS) can identify SVs to base-pair resolution, but utilizes only short-range information and suffers from high false discovery rate (FDR). Linked-reads sequencing (10XWGS) utilizes long-range information by linkage of short-reads originating from the same large DNA molecule. This can mitigate alignment-based artefacts especially in repetitive regions and should enable better prediction of SVs. However, an unbiased evaluation of this technology is not available. In this study, w…

0301 basic medicineFalse discovery rateComputer scienceArtificial Gene Amplification and ExtensionPolymerase Chain ReactionDatabase and Informatics MethodsSequencing techniques0302 clinical medicineBreast TumorsBasic Cancer ResearchMedicine and Health SciencesDNA sequencingBiology (General)EcologyHigh-Throughput Nucleotide SequencingGenomicsDNA Neoplasm3. Good healthIdentification (information)OncologyComputational Theory and MathematicsModeling and SimulationMCF-7 CellsFemaleSequence AnalysisResearch ArticleBioinformaticsQH301-705.5Breast NeoplasmsGenomicsComputational biologyResearch and Analysis MethodsHuman Genomics03 medical and health sciencesCellular and Molecular NeuroscienceCancer GenomicsGenomic MedicineBreast CancerGeneticsDNA Barcoding TaxonomicHumansMolecular Biology TechniquesMolecular BiologyEcology Evolution Behavior and SystematicsWhole genome sequencingLinkage (software)Whole Genome SequencingGenome HumanDideoxy DNA sequencingGenetic Diseases InbornCancers and NeoplasmsBiology and Life SciencesComputational BiologyStatistical modelSequence Analysis DNARepetitive RegionsLogistic Models030104 developmental biologyGenomic Structural VariationHuman genomeSequence Alignment030217 neurology & neurosurgeryPLoS Computational Biology
researchProduct

Biophysics of high density nanometer regions extracted from super-resolution single particle trajectories: application to voltage-gated calcium chann…

2019

AbstractThe cellular membrane is very heterogenous and enriched with high-density regions forming microdomains, as revealed by single particle tracking experiments. However the organization of these regions remain unexplained. We determine here the biophysical properties of these regions, when described as a basin of attraction. We develop two methods to recover the dynamics and local potential wells (field of force and boundary). The first method is based on the local density of points distribution of trajectories, which differs inside and outside the wells. The second method focuses on recovering the drift field that is convergent inside wells and uses the transient field to determine the…

0301 basic medicineField (physics)1.1 Normal biological development and functioningHigh densityBoundary (topology)lcsh:Medicine32 Biomedical and Clinical SciencesLocal field potentialArticleQuantitative Biology::Cell BehaviorQuantitative Biology::Subcellular ProcessesComputational biophysics03 medical and health sciences0302 clinical medicineSingle-molecule biophysics1 Underpinning researchlcsh:SciencePhysicsMultidisciplinary3208 Medical PhysiologyVoltage-dependent calcium channelFOS: Clinical medicinelcsh:RNeurosciencesScientific data030104 developmental biologyParticleNanometrelcsh:QBiological systemBiological physics51 Physical Sciences030217 neurology & neurosurgeryEnergy (signal processing)
researchProduct

Feasibility of sample size calculation for RNA-seq studies

2017

Sample size calculation is a crucial step in study design but is not yet fully established for RNA sequencing (RNA-seq) analyses. To evaluate feasibility and provide guidance, we evaluated RNA-seq sample size tools identified from a systematic search. The focus was on whether real pilot data would be needed for reliable results and on identifying tools that would perform well in scenarios with different levels of biological heterogeneity and fold changes (FCs) between conditions. We used simulations based on real data for tool evaluation. In all settings, the six evaluated tools provided widely different answers, which were strongly affected by FC. Although all tools failed for small FCs, s…

0301 basic medicineFold (higher-order function)Sequence Analysis RNAComputer scienceHigh-Throughput Nucleotide SequencingRNA-Seqcomputer.software_genre03 medical and health sciences030104 developmental biology0302 clinical medicineResearch DesignSample size determinationSample SizeFeasibility StudiesHumansData miningMolecular BiologycomputerSoftware030217 neurology & neurosurgeryInformation SystemsSystematic searchBriefings in Bioinformatics
researchProduct

Use of deep learning methods to translate drug-induced gene expression changes from rat to human primary hepatocytes

2020

In clinical trials, animal and cell line models are often used to evaluate the potential toxic effects of a novel compound or candidate drug before progressing to human trials. However, relating the results of animal and in vitro model exposures to relevant clinical outcomes in the human in vivo system still proves challenging, relying on often putative orthologs. In recent years, multiple studies have demonstrated that the repeated dose rodent bioassay, the current gold standard in the field, lacks sufficient sensitivity and specificity in predicting toxic effects of pharmaceuticals in humans. In this study, we evaluate the potential of deep learning techniques to translate the pattern of …

0301 basic medicineGene ExpressionGene Expression Regulation/drug effectsPathology and Laboratory MedicineConvolutional neural networkTOXICITYMachine LearningVoeding Metabolisme en GenomicaTime Measurement0302 clinical medicineGene expressionMedicine and Health SciencesMeasurementClinical Trials as TopicMultidisciplinaryArtificial neural networkPharmaceuticsQRMetabolism and GenomicsTOXICOGENOMICS030220 oncology & carcinogenesisMetabolisme en GenomicaMedicineEngineering and TechnologyNutrition Metabolism and GenomicsHepatocytes/drug effectsAlgorithmsResearch ArticleComputer and Information SciencesClinical Trials as Topic/statistics & numerical dataNeural NetworksGenetic ToxicologyTOXICOLOGYSciencePredictive ToxicologyComputational biologyBiologyComputer03 medical and health sciencesDose Prediction MethodsDeep LearningVoedingArtificial IntelligenceIn vivoGeneticsLife ScienceAnimalsHumansGeneNutritionbusiness.industryDeep learningBiology and Life SciencesGold standard (test)REPRESENTATIONSRats030104 developmental biologyGene Expression RegulationHepatocytesArtificial intelligenceNeural Networks ComputerToxicogenomicsbusinessNeuroscience
researchProduct

Parallel paleogenomic transects reveal complex genetic history of early European farmers

2017

In European Neolithic populations, the arrival of farmers prompted admixture with local hunter-gatherers over many centuries, resulting in distinct signatures in each region due to a complex series of interactions. David Reich and colleagues analyse genome-wide data from 180 individuals from the Neolithic and Chalcolithic periods of Hungary, Germany and Spain to study the population dynamics of Neolithization in European prehistory. They examine how gene flow reshaped European populations during the Neolithic period, including pervasive admixture—the interbreeding between previously isolated populations—between groups with different ancestry profiles. In each region, they find that the arri…

0301 basic medicineGene FlowMale0106 biological sciencesHuman MigrationPopulation DynamicsPopulationDatasets as Topic010603 evolutionary biology01 natural sciencesArticleGene flowPrehistory03 medical and health sciencesSpatio-Temporal AnalysisGermanyGenetic variationHumansDNA AncienteducationTransectHistory Ancient030304 developmental biologyHungary0303 health scienceseducation.field_of_studyGenetic diversityMultidisciplinaryFarmersHuman migrationbusiness.industryEcologyGenetic VariationChalcolithic030104 developmental biologyAncient DNAGeographySpainPeriod (geology)EthnologyFemalebusiness
researchProduct

MiasDB: A Database of Molecular Interactions Associated with Alternative Splicing of Human Pre-mRNAs.

2016

Alternative splicing (AS) is pervasive in human multi-exon genes and is a major contributor to expansion of the transcriptome and proteome diversity. The accurate recognition of alternative splice sites is regulated by information contained in networks of protein-protein and protein-RNA interactions. However, the mechanisms leading to splice site selection are not fully understood. Although numerous databases have been built to describe AS, molecular interaction databases associated with AS have only recently emerged. In this study, we present a new database, MiasDB, that provides a description of molecular interactions associated with human AS events. This database covers 938 interactions …

0301 basic medicineGene regulatory networklcsh:MedicineRNA-binding proteinRNA-binding proteinscomputer.software_genreBiochemistryHistonesExonDatabase and Informatics MethodsDatabases GeneticProtein Interaction MappingRNA PrecursorsGene Regulatory NetworksDatabase Searchinglcsh:ScienceMultidisciplinaryDatabaseExonsGenomicsGenomic DatabasesNucleic acidsRNA splicingProteomeSequence AnalysisResearch ArticleSequence DatabasesBiologyResponse ElementsResearch and Analysis MethodsGenome Complexity03 medical and health sciencesGeneticsHumansMolecular Biology TechniquesSequencing TechniquesProtein InteractionsGeneMolecular BiologyInternetlcsh:RAlternative splicingIntronBiology and Life SciencesComputational BiologyProteinsGenome AnalysisIntronsAlternative Splicing030104 developmental biologyBiological DatabasesRNA processingRNAlcsh:QRNA Splice SitesGene expressioncomputerProtein KinasesTranscription FactorsPloS one
researchProduct

Common Hits Approach: Combining Pharmacophore Modeling and Molecular Dynamics Simulations.

2017

We present a new approach that incorporates flexibility based on extensive MD simulations of protein-ligand complexes into structure-based pharmacophore modeling and virtual screening. The approach uses the multiple coordinate sets saved during the MD simulations and generates for each frame a pharmacophore model. Pharmacophore models with the same pharmacophore features are pooled. In this way the high number of pharmacophore models that results from the MD simulation is reduced to only a few hundred representative pharmacophore models. Virtual screening runs are performed with every representative pharmacophore model; the screening results are combined and rescored to generate a single hi…

0301 basic medicineGeneral Chemical EngineeringDrug Evaluation PreclinicalLibrary and Information SciencesMolecular Dynamics Simulationcomputer.software_genreLigandsLigandScoutCommon Hits Approach (CHA)03 medical and health sciencesMolecular dynamicsUser-Computer InterfaceComputational chemistryPharmacophore ModelingFlexibility (engineering)Virtual screeningChemistryFrame (networking)ProteinsGeneral ChemistryInto-structureSettore CHIM/08 - Chimica FarmaceuticaComputer Science Applications030104 developmental biologyData miningPharmacophorecomputerJournal of chemical information and modeling
researchProduct

Measuring the clustering effect of BWT via RLE

2017

Abstract The Burrows–Wheeler Transform (BWT) is a reversible transformation on which are based several text compressors and many other tools used in Bioinformatics and Computational Biology. The BWT is not actually a compressor, but a transformation that performs a context-dependent permutation of the letters of the input text that often create runs of equal letters (clusters) longer than the ones in the original text, usually referred to as the “clustering effect” of BWT. In particular, from a combinatorial point of view, great attention has been given to the case in which the BWT produces the fewest number of clusters (cf. [5] , [16] , [21] , [23] ). In this paper we are concerned about t…

0301 basic medicineGeneral Computer SciencePermutationComputer Science (all)Binary number0102 computer and information sciencesQuantitative Biology::Genomics01 natural sciencesUpper and lower boundsTheoretical Computer ScienceCombinatorics03 medical and health sciencesPermutation030104 developmental biologyTransformation (function)BWT010201 computation theory & mathematicsRun-length encodingComputer Science::Data Structures and AlgorithmsCluster analysisPrimitive root modulo nBWT; Permutation; Run-length encoding; Theoretical Computer Science; Computer Science (all)Word (computer architecture)Run-length encodingMathematics
researchProduct

Data on the effects of low iron diet on serum lipid profile in HCV transgenic mouse model

2017

Here, we presented new original data on the effects of iron depletion on the circulating lipid profile in B6HCV mice, a murine model of HCV-related dyslipidemia. Male adult B6HCV mice were subjected to non-invasive iron depletion by low iron diet. Serum iron concentration was assessed for evaluating the effects of the dietary iron depletion. Concentrations of circulating triglycerides, total cholesterol, Low Density Lipoproteins (LDLs), High Density Lipoproteins (HDLs) were analyzed and reported by using stacked line charts. The present data indicated that low serum iron concentration is associated to i) lower serum triglycerides concentrations and ii) increased circulating LDLs. The presen…

0301 basic medicineGenetically modified mousemedicine.medical_specialtyLow density lipoproteins3304High densityLow density lipoproteinlcsh:Computer applications to medicine. Medical informaticsTriglyceride03 medical and health sciences0302 clinical medicineInternal medicinemedicineIron depletion; Low density lipoproteins; Triglycerides; 3304; MultidisciplinarySerum triglycerideslcsh:Science (General)TriglyceridesData ArticleDietary ironMultidisciplinarymedicine.diagnostic_testChemistrymedicine.diseaseIron depletion030104 developmental biologyEndocrinologyBiochemistrySerum ironlcsh:R858-859.7030211 gastroenterology & hepatologylipids (amino acids peptides and proteins)Lipid profileDyslipidemiaIron depletionlcsh:Q1-390Data in Brief
researchProduct