Search results for "Computer and Information Science"

showing 10 items of 1335 documents

Measuring spectrally-resolved information transfer.

2020

Information transfer, measured by transfer entropy, is a key component of distributed computation. It is therefore important to understand the pattern of information transfer in order to unravel the distributed computational algorithms of a system. Since in many natural systems distributed computation is thought to rely on rhythmic processes a frequency resolved measure of information transfer is highly desirable. Here, we present a novel algorithm, and its efficient implementation, to identify separately frequencies sending and receiving information in a network. Our approach relies on the invertible maximum overlap discrete wavelet transform (MODWT) for the creation of surrogate data in t…

0301 basic medicineDiscrete wavelet transformInformation transferComputer scienceEntropyInformation Theory0302 clinical medicineWaveletMathematical and Statistical TechniquesMedicine and Health SciencesBiology (General)Wavelet TransformsTemporal cortexMammalsEcologySystems BiologyApplied MathematicsSimulation and ModelingPhysicsWavelet transformMagnetoencephalographyEukaryotaBrainSignal FilteringComputational Theory and MathematicsModeling and SimulationPhysical SciencesVertebratesThermodynamicsEngineering and TechnologyWavelet transforms ; Algorithms ; Magnetoencephalography ; Information entropy ; Signal filtering ; Ferrets ; Permutation ; EntropyAnatomyAlgorithmInformation EntropyAlgorithmsResearch ArticleComputer and Information SciencesQH301-705.5PermutationWavelet AnalysisPrefrontal CortexResearch and Analysis Methods03 medical and health sciencesCellular and Molecular NeuroscienceGeneticsEntropy (information theory)AnimalsHumansInformation flow (information theory)Molecular BiologyEcology Evolution Behavior and SystematicsDiscrete MathematicsFerretsOrganismsBiology and Life Sciences030104 developmental biologyCombinatoricsSignal ProcessingAmniotesTransfer entropyZoologyMathematical Functions030217 neurology & neurosurgeryMathematicsPLoS computational biology
researchProduct

Detecting mutations by eBWT

2018

In this paper we develop a theory describing how the extended Burrows-Wheeler Transform (eBWT) of a collection of DNA fragments tends to cluster together the copies of nucleotides sequenced from a genome G. Our theory accurately predicts how many copies of any nucleotide are expected inside each such cluster, and how an elegant and precise LCP array based procedure can locate these clusters in the eBWT. Our findings are very general and can be applied to a wide range of different problems. In this paper, we consider the case of alignment-free and reference-free SNPs discovery in multiple collections of reads. We note that, in accordance with our theoretical results, SNPs are clustered in th…

0301 basic medicineFOS: Computer and information sciences000 Computer science knowledge general worksBWT LCP Array SNPs Reference-free Assembly-freeLCP ArraySettore INF/01 - Informatica[SDV]Life Sciences [q-bio]Reference-freeAssembly-freeSNP03 medical and health sciences030104 developmental biologyBWTBWT; LCP Array; SNPs; Reference-free; Assembly-freeComputer ScienceComputer Science - Data Structures and AlgorithmsData Structures and Algorithms (cs.DS)[INFO]Computer Science [cs]SoftwareSNPs
researchProduct

The colored longest common prefix array computed via sequential scans

2018

Due to the increased availability of large datasets of biological sequences, the tools for sequence comparison are now relying on efficient alignment-free approaches to a greater extent. Most of the alignment-free approaches require the computation of statistics of the sequences in the dataset. Such computations become impractical in internal memory when very large collections of long sequences are considered. In this paper, we present a new conceptual data structure, the colored longest common prefix array (cLCP), that allows to efficiently tackle several problems with an alignment-free approach. In fact, we show that such a data structure can be computed via sequential scans in semi-exter…

0301 basic medicineFOS: Computer and information sciencesAlignment-free methodsBurrows–Wheeler transformComputer scienceComputationAverage common substring0206 medical engineeringMatching statisticsScale (descriptive set theory)02 engineering and technologyTheoretical Computer Science03 medical and health sciencesComputer Science - Data Structures and AlgorithmsData Structures and Algorithms (cs.DS)Burrows-wheeler transformString (computer science)Computer Science (all)LCP arrayMatching statisticData structureSubstring030104 developmental biologyAlignment-free methods; Average common substring; Burrows-wheeler transform; Longest common prefix; Matching statistics; Theoretical Computer Science; Computer Science (all)Pairwise comparisonLongest common prefixAlgorithm020602 bioinformaticsAlignment-free method
researchProduct

Q-nexus: a comprehensive and efficient analysis pipeline designed for ChIP-nexus

2016

Background: ChIP-nexus, an extension of the ChIP-exo protocol, can be used to map the borders of protein-bound DNA sequences at nucleotide resolution, requires less input DNA and enables selective PCR duplicate removal using random barcodes. However, the use of random barcodes requires additional preprocessing of the mapping data, which complicates the computational analysis. To date, only a very limited number of software packages are available for the analysis of ChIP-exo data, which have not yet been systematically tested and compared on ChIP-nexus data. Results: Here, we present a comprehensive software package for ChIP-nexus data that exploits the random barcodes for selective removal …

0301 basic medicineFOS: Computer and information sciencesDuplication ratesChromatin ImmunoprecipitationBioinformaticsPipeline (computing)610Biologycomputer.software_genre600 Technik Medizin angewandte Wissenschaften::610 Medizin und Gesundheit03 medical and health sciencesSoftwareChIP-nexusGeneticsPreprocessorNucleotide MotifsLibrary complexityChIP-exoGeneticsProtocol (science)Binding Sitesbusiness.industryfungiComputational BiologyHigh-Throughput Nucleotide SequencingReproducibility of ResultsChipChromatin immunoprecipitationData mappingDNA-Binding ProteinsAlgorithm030104 developmental biologyChIP-exoData miningbusinessPeak callingcomputerAlgorithmsSoftwareProtein BindingTranscription FactorsResearch ArticleBiotechnologyBMC Genomics
researchProduct

Alignment-free sequence comparison using absent words

2018

Sequence comparison is a prerequisite to virtually all comparative genomic analyses. It is often realised by sequence alignment techniques, which are computationally expensive. This has led to increased research into alignment-free techniques, which are based on measures referring to the composition of sequences in terms of their constituent patterns. These measures, such as $q$-gram distance, are usually computed in time linear with respect to the length of the sequences. In this paper, we focus on the complementary idea: how two sequences can be efficiently compared based on information that does not occur in the sequences. A word is an {\em absent word} of some sequence if it does not oc…

0301 basic medicineFOS: Computer and information sciencesFormal Languages and Automata Theory (cs.FL)Computer Science - Formal Languages and Automata TheorySequence alignmentInformation System0102 computer and information sciencesCircular wordAbsent words01 natural sciencesUpper and lower boundsSequence comparisonTheoretical Computer ScienceCombinatorics03 medical and health sciencesComputer Science - Data Structures and AlgorithmsData Structures and Algorithms (cs.DS)Absent wordCircular wordsMathematicsSequenceSettore INF/01 - InformaticaProcess (computing)q-gramComputer Science Applications1707 Computer Vision and Pattern Recognitionq-gramsComposition (combinatorics)Computer Science Applications030104 developmental biologyComputational Theory and MathematicsForbidden words010201 computation theory & mathematicsFocus (optics)Forbidden wordWord (computer architecture)Information SystemsInteger (computer science)
researchProduct

Use of deep learning methods to translate drug-induced gene expression changes from rat to human primary hepatocytes

2020

In clinical trials, animal and cell line models are often used to evaluate the potential toxic effects of a novel compound or candidate drug before progressing to human trials. However, relating the results of animal and in vitro model exposures to relevant clinical outcomes in the human in vivo system still proves challenging, relying on often putative orthologs. In recent years, multiple studies have demonstrated that the repeated dose rodent bioassay, the current gold standard in the field, lacks sufficient sensitivity and specificity in predicting toxic effects of pharmaceuticals in humans. In this study, we evaluate the potential of deep learning techniques to translate the pattern of …

0301 basic medicineGene ExpressionGene Expression Regulation/drug effectsPathology and Laboratory MedicineConvolutional neural networkTOXICITYMachine LearningVoeding Metabolisme en GenomicaTime Measurement0302 clinical medicineGene expressionMedicine and Health SciencesMeasurementClinical Trials as TopicMultidisciplinaryArtificial neural networkPharmaceuticsQRMetabolism and GenomicsTOXICOGENOMICS030220 oncology & carcinogenesisMetabolisme en GenomicaMedicineEngineering and TechnologyNutrition Metabolism and GenomicsHepatocytes/drug effectsAlgorithmsResearch ArticleComputer and Information SciencesClinical Trials as Topic/statistics & numerical dataNeural NetworksGenetic ToxicologyTOXICOLOGYSciencePredictive ToxicologyComputational biologyBiologyComputer03 medical and health sciencesDose Prediction MethodsDeep LearningVoedingArtificial IntelligenceIn vivoGeneticsLife ScienceAnimalsHumansGeneNutritionbusiness.industryDeep learningBiology and Life SciencesGold standard (test)REPRESENTATIONSRats030104 developmental biologyGene Expression RegulationHepatocytesArtificial intelligenceNeural Networks ComputerToxicogenomicsbusinessNeuroscience
researchProduct

Measuring the clustering effect of BWT via RLE

2017

Abstract The Burrows–Wheeler Transform (BWT) is a reversible transformation on which are based several text compressors and many other tools used in Bioinformatics and Computational Biology. The BWT is not actually a compressor, but a transformation that performs a context-dependent permutation of the letters of the input text that often create runs of equal letters (clusters) longer than the ones in the original text, usually referred to as the “clustering effect” of BWT. In particular, from a combinatorial point of view, great attention has been given to the case in which the BWT produces the fewest number of clusters (cf. [5] , [16] , [21] , [23] ). In this paper we are concerned about t…

0301 basic medicineGeneral Computer SciencePermutationComputer Science (all)Binary number0102 computer and information sciencesQuantitative Biology::Genomics01 natural sciencesUpper and lower boundsTheoretical Computer ScienceCombinatorics03 medical and health sciencesPermutation030104 developmental biologyTransformation (function)BWT010201 computation theory & mathematicsRun-length encodingComputer Science::Data Structures and AlgorithmsCluster analysisPrimitive root modulo nBWT; Permutation; Run-length encoding; Theoretical Computer Science; Computer Science (all)Word (computer architecture)Run-length encodingMathematics
researchProduct

Coupling News Sentiment with Web Browsing Data Improves Prediction of Intra-Day Price Dynamics

2015

The new digital revolution of big data is deeply changing our capability of understanding society and forecasting the outcome of many social and economic systems. Unfortunately, information can be very heterogeneous in the importance, relevance, and surprise it conveys, affecting severely the predictive power of semantic and statistical methods. Here we show that the aggregation of web users' behavior can be elicited to overcome this problem in a hard to predict complex system, namely the financial market. Specifically, our in-sample analysis shows that the combined use of sentiment analysis of news and browsing activity of users of Yahoo! Finance greatly helps forecasting intra-day and dai…

0301 basic medicineINFORMATIONEconomicsComputer scienceBig datalcsh:MedicineSocial SciencesQuantitative Finance - Computational Financesocial and economic systemsMathematical and Statistical TechniquesSociologybig dataEconometrics050207 economicsComputer NetworksCapital Marketslcsh:ScienceFinancial Marketsmedia_common050208 financeMultidisciplinary05 social sciencesCommerceSocial CommunicationSettore FIS/02 - Fisica Teorica Modelli e Metodi MatematiciSurpriseModels EconomicSocial NetworksPhysical SciencesSocial SystemsEngineering and TechnologyComputational sociologyBEHAVIORStatistics (Mathematics)Network AnalysisResearch ArticleComputer and Information SciencesExploitmedia_common.quotation_subjectTwitterComputational Finance (q-fin.CP)Research and Analysis MethodsFOS: Economics and business03 medical and health sciencesSEARCH0502 economics and businessHumansRelevance (information retrieval)Web navigationInvestmentsStatistical MethodsInternetStatistical Finance (q-fin.ST)STOCK-MARKETbusiness.industrylcsh:RSentiment analysisFinancial marketATTENTIONQuantitative Finance - Statistical FinanceCommunicationsNoise ReductionFinancial Firms030104 developmental biologySignal ProcessingPredictive powerlcsh:QStock marketbusinessSocial MediaFinanceMathematicsForecastingPLOS ONE
researchProduct

Linear-time sequence comparison using minimal absent words & applications

2016

Sequence comparison is a prerequisite to virtually all comparative genomic analyses. It is often realized by sequence alignment techniques, which are computationally expensive. This has led to increased research into alignment-free techniques, which are based on measures referring to the composition of sequences in terms of their constituent patterns. These measures, such as q-gram distance, are usually computed in time linear with respect to the length of the sequences. In this article, we focus on the complementary idea: how two sequences can be efficiently compared based on information that does not occur in the sequences. A word is an absent word of some sequence if it does not occur in…

0301 basic medicineLatin AmericansComputer Science (all)Library science0102 computer and information sciencesCircular wordAlgorithms on string01 natural sciencesAlignmentfree comparisonSequence comparisonTheoretical Computer Science03 medical and health sciences030104 developmental biology010201 computation theory & mathematicsInformaticsPolitical scienceAbsent wordForbidden word
researchProduct

Dynamic longitudinal behavior in animals exposed to chronic social defeat stress

2020

AbstractChronic social defeat (CSD) can lead to impairments in social interaction and other behaviors that are supposed to model features of major depressive disorder (MDD). Not all animals subjected to CSD, however, develop these impairments, and maintained social interaction in some animals is widely used as a model for resilience to stress-induced mental dysfunctions. So far, animals have mainly been studied shortly (24 hours and 7 days) after CSD exposure and longitudinal development of behavioral phenotypes in individual animals has been mostly neglected. We have analyzed social interaction and novel object recognition behavior of stressed mice at different time points after CSD and ha…

0301 basic medicineMaleBehavioral phenotypesTime FactorsSocial SciencesSocial defeatMice0302 clinical medicineCognitionLearning and MemoryStress (linguistics)PsychologyLongitudinal Studiesmedia_commonMammalsMultidisciplinaryAnimal BehaviorBehavior AnimalQREukaryotaResilience PsychologicalLongitudinal developmentAggressionAnimal SocialityVertebratesMedicineMajor depressive disorderPsychological resilienceDisease SusceptibilityPsychologyBehavior Observation TechniquesNetwork AnalysisClinical psychologyResearch ArticleComputer and Information SciencesSciencemedia_common.quotation_subjectRodentsNetwork Resilience03 medical and health sciencesMemorymedicineAnimalsHumansInterpersonal RelationsNovel object recognitionBehaviorDepressive Disorder MajorNetwork resilience ; Visual object recognition ; Animal performance ; Behavior ; Animal sociality ; Collective animal behavior ; Animal behavior ; MiceOrganismsCognitive PsychologyBiology and Life SciencesCollective Animal Behaviormedicine.diseaseSocial relationDisease Models Animal030104 developmental biologyCollective Human BehaviorAmniotesChronic DiseaseCognitive SciencePerceptionCollective animal behaviorVisual Object RecognitionZoology030217 neurology & neurosurgeryStress PsychologicalNeurosciencePLoS ONE
researchProduct