Search results for "riso"

showing 10 items of 1451 documents

Efficient Algorithms for Sequence Analysis with Entropic Profiles

2017

Entropy, being closely related to repetitiveness and compressibility, is a widely used information-related measure to assess the degree of predictability of a sequence. Entropic profiles are based on information theory principles, and can be used to study the under-/over-representation of subwords, by also providing information about the scale of conserved DNA regions. Here, we focus on the algorithmic aspects related to entropic profiles. In particular, we propose linear time algorithms for their computation that rely on suffix-based data structures, more specifically on the truncated suffix tree (TST) and on the enhanced suffix array (ESA). We performed an extensive experimental campaign …

0301 basic medicineCompressed suffix arrayTheoretical computer scienceEntropySuffix tree0206 medical engineeringGeneralized suffix tree02 engineering and technologyString searching algorithmInformation theorylaw.invention03 medical and health scienceslawGeneticsAnimalsHumansMathematicsApplied MathematicsSuffix arrayComputational BiologyDNASequence Analysis DNAData structure030104 developmental biologySuffixAlignment free Entropy Sequence analysis Sequence comparisonAlgorithms020602 bioinformaticsBiotechnologyIEEE/ACM Transactions on Computational Biology and Bioinformatics
researchProduct

Parallel Pairwise Epistasis Detection on Heterogeneous Computing Architectures

2016

This is a post-peer-review, pre-copyedit version of an article published in IEEE Transactions on Parallel and Distributed Systems. The final authenticated version is available online at: http://dx.doi.org/10.1109/TPDS.2015.2460247. [Abstract] Development of new methods to detect pairwise epistasis, such as SNP-SNP interactions, in Genome-Wide Association Studies is an important task in bioinformatics as they can help to explain genetic influences on diseases. As these studies are time consuming operations, some tools exploit the characteristics of different hardware accelerators (such as GPUs and Xeon Phi coprocessors) to reduce the runtime. Nevertheless, all these approaches are not able t…

0301 basic medicineCoprocessorComputer science0206 medical engineeringAccelerationData modelsSymmetric multiprocessor systemComputational modeling02 engineering and technologyParallel computingSupercomputer03 medical and health sciencesTask (computing)030104 developmental biologyCoprocessorsComputational Theory and MathematicsHardware and ArchitectureSignal ProcessingGeneticsPairwise comparisonComputer architectureGraphics processing units020602 bioinformaticsXeon Phi
researchProduct

Inferring causation from time series in earth system sciences

2019

The heart of the scientific enterprise is a rational effort to understand the causes behind the phenomena we observe. In large-scale complex dynamical systems such as the Earth system, real experiments are rarely feasible. However, a rapidly increasing amount of observational and simulated data opens up the use of novel data-driven causal methods beyond the commonly adopted correlation techniques. Here, we give an overview of causal inference frameworks and identify promising generic application cases common in Earth system sciences and beyond. We discuss challenges and initiate the benchmark platform causeme.net to close the gap between method users and developers.

0301 basic medicineEarth scienceAquatic Ecology and Water Quality ManagementDynamical systems theoryComputer science530 PhysicsDatenmanagement und AnalyseSciencereviewGeneral Physics and Astronomyheart02 engineering and technologyGeneral Biochemistry Genetics and Molecular Biology03 medical and health sciencesDatabasesLife ScienceCausationStatistical physics thermodynamics and nonlinear dynamicsintermethod comparisonlcsh:Scienceresearch workScientific enterpriseMultidisciplinaryWIMEKSeries (mathematics)QComputational sciencefeasibility study500General ChemistryAquatische Ecologie en Waterkwaliteitsbeheersimulation021001 nanoscience & nanotechnologyData sciencecausal inference climateEarth system scienceEnvironmental sciences030104 developmental biologytime series analysisCausal inferencePerspectiveBenchmark (computing)Observational studylcsh:Qconceptual frameworkdata management0210 nano-technologyClimate sciences
researchProduct

The colored longest common prefix array computed via sequential scans

2018

Due to the increased availability of large datasets of biological sequences, the tools for sequence comparison are now relying on efficient alignment-free approaches to a greater extent. Most of the alignment-free approaches require the computation of statistics of the sequences in the dataset. Such computations become impractical in internal memory when very large collections of long sequences are considered. In this paper, we present a new conceptual data structure, the colored longest common prefix array (cLCP), that allows to efficiently tackle several problems with an alignment-free approach. In fact, we show that such a data structure can be computed via sequential scans in semi-exter…

0301 basic medicineFOS: Computer and information sciencesAlignment-free methodsBurrows–Wheeler transformComputer scienceComputationAverage common substring0206 medical engineeringMatching statisticsScale (descriptive set theory)02 engineering and technologyTheoretical Computer Science03 medical and health sciencesComputer Science - Data Structures and AlgorithmsData Structures and Algorithms (cs.DS)Burrows-wheeler transformString (computer science)Computer Science (all)LCP arrayMatching statisticData structureSubstring030104 developmental biologyAlignment-free methods; Average common substring; Burrows-wheeler transform; Longest common prefix; Matching statistics; Theoretical Computer Science; Computer Science (all)Pairwise comparisonLongest common prefixAlgorithm020602 bioinformaticsAlignment-free method
researchProduct

Alignment-free sequence comparison using absent words

2018

Sequence comparison is a prerequisite to virtually all comparative genomic analyses. It is often realised by sequence alignment techniques, which are computationally expensive. This has led to increased research into alignment-free techniques, which are based on measures referring to the composition of sequences in terms of their constituent patterns. These measures, such as $q$-gram distance, are usually computed in time linear with respect to the length of the sequences. In this paper, we focus on the complementary idea: how two sequences can be efficiently compared based on information that does not occur in the sequences. A word is an {\em absent word} of some sequence if it does not oc…

0301 basic medicineFOS: Computer and information sciencesFormal Languages and Automata Theory (cs.FL)Computer Science - Formal Languages and Automata TheorySequence alignmentInformation System0102 computer and information sciencesCircular wordAbsent words01 natural sciencesUpper and lower boundsSequence comparisonTheoretical Computer ScienceCombinatorics03 medical and health sciencesComputer Science - Data Structures and AlgorithmsData Structures and Algorithms (cs.DS)Absent wordCircular wordsMathematicsSequenceSettore INF/01 - InformaticaProcess (computing)q-gramComputer Science Applications1707 Computer Vision and Pattern Recognitionq-gramsComposition (combinatorics)Computer Science Applications030104 developmental biologyComputational Theory and MathematicsForbidden words010201 computation theory & mathematicsFocus (optics)Forbidden wordWord (computer architecture)Information SystemsInteger (computer science)
researchProduct

Conjugative ESBL plasmids differ in their potential to rescue susceptible bacteria via horizontal gene transfer in lethal antibiotic concentrations.

2017

Conjugative ESBL plasmids differ in their potential to rescue susceptible bacteria via horizontal gene transfer in lethal antibiotic concentrations

0301 basic medicineGene Transfer Horizontalmedicine.drug_classAntiparasitic030106 microbiologyAntibioticsGene transferDrug resistanceBiologybeta-LactamasesMicrobiology03 medical and health sciencesplasmiditPlasmidDrug DiscoveryDrug Resistance Bacterialpolycyclic compoundsmedicineEscherichia coliHumansantimicrobial resistanceEscherichia coli InfectionsPharmacologyta1182biochemical phenomena metabolism and nutritionbacterial infections and mycosesbiology.organism_classificationGlycopeptide3. Good healthAnti-Bacterial Agentsbacterial conjugationHorizontal gene transferhorizontal gene transferhorisontaalinen geeninsiirtoBacteriaPlasmidsThe Journal of antibiotics
researchProduct

Linear-time sequence comparison using minimal absent words & applications

2016

Sequence comparison is a prerequisite to virtually all comparative genomic analyses. It is often realized by sequence alignment techniques, which are computationally expensive. This has led to increased research into alignment-free techniques, which are based on measures referring to the composition of sequences in terms of their constituent patterns. These measures, such as q-gram distance, are usually computed in time linear with respect to the length of the sequences. In this article, we focus on the complementary idea: how two sequences can be efficiently compared based on information that does not occur in the sequences. A word is an absent word of some sequence if it does not occur in…

0301 basic medicineLatin AmericansComputer Science (all)Library science0102 computer and information sciencesCircular wordAlgorithms on string01 natural sciencesAlignmentfree comparisonSequence comparisonTheoretical Computer Science03 medical and health sciences030104 developmental biology010201 computation theory & mathematicsInformaticsPolitical scienceAbsent wordForbidden word
researchProduct

Improvement of a rapid direct blood culture microbial identification protocol using MALDI-TOF MS and performance comparison with SepsiTyper kit

2018

Fast diagnosis of pathogens is critical to guarantee the most adequate therapy for infections; bacterial culture methods, which constitute the actual gold standard, are precise and sensitive but rather slow. Today, new methods have been made available to enable faster diagnosis, with the Matrix-Assisted Laser Desorption Ionization-Time Of Flight Mass Spectrometry (MALDI-TOF MS) technique being the most promising. Even if simpler and faster than traditional bacterial culture methods, analysis of positive blood cultures via MALDI-TOF MS requires a preliminary extraction process of samples. In this study, we compared two extraction protocols for bacterial identification directly from positive …

0301 basic medicineMicrobiology (medical)Time FactorsComputer science030106 microbiologyBacteremiaClinical diagnostic laboratorySensitivity and SpecificityMicrobiology03 medical and health sciencesSpecies SpecificitymedicineHumansBlood cultureOverall performanceMolecular BiologyProtocol (science)Bacteriological TechniquesChromatographyBacteriamedicine.diagnostic_testDiagnostic Tests RoutineGold standard (test)Matrix-assisted laser desorption/ionizationIdentification (information)Blood CulturePathogens identificationSpectrometry Mass Matrix-Assisted Laser Desorption-IonizationPerformance comparisonCosts and Cost AnalysisGenus and species identificationMatrix- assisted laser desorption ionization time of flight mass spectrometryJournal of Microbiological Methods
researchProduct

Evolving Notch polyQ tracts reveal possible solenoid interference elements.

2016

ABSTRACTPolyglutamine (polyQ) tracts in regulatory proteins are extremely polymorphic. As functional elements under selection for length, triplet repeats are prone to DNA replication slippage and indel mutations. Many polyQ tracts are also embedded within intrinsically disordered domains, which are less constrained, fast evolving, and difficult to characterize. To identify structural principles underlying polyQ tracts in disordered regulatory domains, here I analyze deep evolution of metazoan Notch polyQ tracts, which can generate alleles causing developmental and neurogenic defects. I show that Notch features polyQ tract turnover that is restricted to a discrete number of conserved “polyQ …

0301 basic medicineModels MolecularProtein Structure ComparisonProtein FoldingHuntingtinlcsh:MedicineCarboxamideAnkyrin Repeat DomainBiochemistryProtein Structure SecondaryDatabase and Informatics Methods0302 clinical medicineProtein structureMacromolecular Structure AnalysisDrosophila Proteinslcsh:ScienceGeneticsHuntingtin ProteinMultidisciplinaryReceptors NotchChemistryDrosophila MelanogasterAnimal ModelsCell biologyInsectsExperimental Organism SystemsProtein foldingDrosophilaSequence AnalysisResearch ArticleMultiple Alignment CalculationProtein StructureArthropodamedicine.drug_classBioinformaticsProtein domainSequence alignmentBiologyIntrinsically disordered proteinsResearch and Analysis MethodsTerminal loopEvolution Molecular03 medical and health sciencesModel OrganismsProtein DomainsSequence Motif AnalysisComputational TechniquesmedicineHuntingtin ProteinAnimalsIndelMolecular BiologyRepetitive Sequences Nucleic AcidModels GeneticSequence Homology Amino Acidlcsh:RDNA replicationOrganismsBiology and Life SciencesProteinsHydrogen BondingInvertebratesSplit-Decomposition MethodIntrinsically Disordered Proteins030104 developmental biologyAnkyrin repeatlcsh:QPeptidesSequence Alignment030217 neurology & neurosurgeryPLoS ONE
researchProduct

A multicentre analytical comparison study of inter-reader and inter-assay agreement of four programmed death-ligand 1 immunohistochemistry assays for…

2020

AIMS Studies in various cancer types have demonstrated discordance between results from different programmed death-ligand 1 (PD-L1) assays. Here, we compare the reproducibility and analytical concordance of four clinically developed assays for assessing PD-L1-positivity in tumour-infiltrating immune cells in the tumour area (PD-L1-IC-positivity) in triple-negative breast cancer (TNBC). METHODS AND RESULTS Primary TNBC resection specimens (n = 30) were selected based on their PD-L1-IC-positivity per VENTANA SP142 ( 5%: eight cases). Serial histological sections were stained for PD-L1 using VENTANA SP142, VENTANA SP263, DAKO 22C3 and DAKO 28-8. PD-L1-IC-positivity and tumour cell expression (…

0301 basic medicineOncologyMalemedicine.medical_specialtyHistologyConcordanceTriple Negative Breast NeoplasmsB7-H1 AntigenPathology and Forensic MedicineCohort Studies03 medical and health sciences0302 clinical medicineBreast cancerLymphocytes Tumor-InfiltratingInternal medicinemedicineBiomarkers TumorHumansTriple-negative breast cancerAgedReproducibilityWhole Genome Sequencingbusiness.industryCancerHigh-Throughput Nucleotide SequencingReproducibility of ResultsGeneral MedicineMiddle Agedmedicine.diseaseImmunohistochemistryddc:030104 developmental biology030220 oncology & carcinogenesisMutationComparison studyImmunohistochemistryFemaleNeoplasm GradingbusinessProgrammed deathHistopathologyReferences
researchProduct