Search results for " Applications"

showing 10 items of 4541 documents

kmcEx: memory-frugal and retrieval-efficient encoding of counted k-mers.

2018

Abstract Motivation K-mers along with their frequency have served as an elementary building block for error correction, repeat detection, multiple sequence alignment, genome assembly, etc., attracting intensive studies in k-mer counting. However, the output of k-mer counters itself is large; very often, it is too large to fit into main memory, leading to highly narrowed usability. Results We introduce a novel idea of encoding k-mers as well as their frequency, achieving good memory saving and retrieval efficiency. Specifically, we propose a Bloom filter-like data structure to encode counted k-mers by coupled-bit arrays—one for k-mer representation and the other for frequency encoding. Exper…

Statistics and ProbabilitySource codeComputer sciencemedia_common.quotation_subject0206 medical engineeringHash function02 engineering and technologyBiochemistry03 medical and health sciencesEncoding (memory)Molecular BiologyTime complexity030304 developmental biologyBlock (data storage)media_common0303 health sciencesSequence Analysis DNAData structureComputer Science ApplicationsComputational MathematicsComputational Theory and MathematicsError detection and correctionAlgorithmSequence Alignment020602 bioinformaticsAlgorithmsSoftwareBioinformatics (Oxford, England)
researchProduct

ArtiFuse—computational validation of fusion gene detection tools without relying on simulated reads

2019

Abstract Motivation Gene fusions are an important class of transcriptional variants that can influence cancer development and can be predicted from RNA sequencing (RNA-seq) data by multiple existing tools. However, the real-world performance of these tools is unclear due to the lack of known positive and negative events, especially with regard to fusion genes in individual samples. Often simulated reads are used, but these cannot account for all technical biases in RNA-seq data generated from real samples. Results Here, we present ArtiFuse, a novel approach that simulates fusion genes by sequence modification to the genomic reference, and therefore, can be applied to any RNA-seq dataset wit…

Statistics and ProbabilitySource codeSequence analysisComputer sciencemedia_common.quotation_subjectValue (computer science)Genomicscomputer.software_genreBiochemistryFusion gene03 medical and health sciences0302 clinical medicineSoftwareMolecular BiologyGene030304 developmental biologymedia_common0303 health sciencesSequence Analysis RNAbusiness.industryHigh-Throughput Nucleotide SequencingRNAGenomicsComputer Science ApplicationsComputational MathematicsComputational Theory and Mathematics030220 oncology & carcinogenesisBenchmark (computing)RNAData miningGene FusionbusinesscomputerSoftwareBioinformatics
researchProduct

NeoFox: annotating neoantigen candidates with neoantigen features

2020

Abstract Summary The detection and prediction of true neoantigens is of great importance for the field of cancer immunotherapy. Wesearched the literature for proposed neoantigen features and integrated them into a toolbox called NEOantigen Feature toolbOX (NeoFox). NeoFox is an easy-to-use Python package that enables the annotation of neoantigen candidates with 16 neoantigen features. Availability and implementation NeoFox is freely available as an open source Python package released under the GNU General Public License (GPL) v3 license at https://github.com/TRON-Bioinformatics/neofox. Supplementary information Supplementary data are available at Bioinformatics online.

Statistics and ProbabilitySupplementary data0303 health sciencesInformation retrievalComputer science030302 biochemistry & molecular biologyPython (programming language)BiochemistryToolbox3. Good healthComputer Science Applications03 medical and health sciencesComputational MathematicsAnnotationOpen sourceComputational Theory and MathematicsMolecular Biologycomputer030304 developmental biologycomputer.programming_languageBioinformatics
researchProduct

IntelliPy: a GUI for analyzing IntelliCage data

2021

Abstract Summary The IntelliCage systems offer the possibility to conduct long-term behavioral experiments on mice in social groups without human intervention. Although this setup provides new findings, only about 150 studies with the IntelliCage system have been published in the last two decades, which is also caused by the challenging problems of processing and handling the large and heterogeneous amounts of captured data. This application note introduces the Python-GUI IntelliPy, especially designed for users not very experienced in using programming languages. IntelliPy allows users to quickly analyze the IntelliCage output in a user-friendly way, thus making the systems more accessible…

Statistics and ProbabilitySupplementary dataAcademicSubjects/SCI01060Computer scienceSystems BiologyMEDLINEBiochemistryApplications NotesComputer Science ApplicationsSocial groupWorld Wide WebComputational MathematicsComputational Theory and MathematicsIntervention (counseling)Molecular BiologyBioinformatics
researchProduct

LipiDisease: associate lipids to diseases using literature mining

2021

Abstract Summary Lipids exhibit an essential role in cellular assembly and signaling. Dysregulation of these functions has been linked with many complications including obesity, diabetes, metabolic disorders, cancer and more. Investigating lipid profiles in such conditions can provide insights into cellular functions and possible interventions. Hence the field of lipidomics is expanding in recent years. Even though the role of individual lipids in diseases has been investigated, there is no resource to perform disease enrichment analysis considering the cumulative association of a lipid set. To address this, we have implemented the LipiDisease web server. The tool analyzes millions of recor…

Statistics and ProbabilitySupplementary dataWeb serverAcademicSubjects/SCI01060Computer scienceCellular functionsComputational biologyDiseasecomputer.software_genreApplications NotesBiochemistryField (computer science)Computer Science ApplicationsComputational MathematicsComputational Theory and MathematicsLipidomicsData and Text MiningMolecular BiologycomputerBioinformatics
researchProduct

RNA-Seq Atlas—a reference database for gene expression profiling in normal tissue by next-generation sequencing

2012

Abstract Motivation: Next-generation sequencing technology enables an entirely new perspective for clinical research and will speed up personalized medicine. In contrast to microarray-based approaches, RNA-Seq analysis provides a much more comprehensive and unbiased view of gene expression. Although the perspective is clear and the long-term success of this new technology obvious, bioinformatics resources making these data easily available especially to the biomedical research community are still evolving. Results: We have generated RNA-Seq Atlas, a web-based repository of RNA-Seq gene expression profiles and query tools. The website offers open and easy access to RNA-Seq gene expression pr…

Statistics and ProbabilitySystems biologyRNA-SeqComputational biologyBiologycomputer.software_genreBiochemistryNeoplasmsGene expressionHumansMicroarray databasesMolecular BiologyGeneOligonucleotide Array Sequence AnalysisInternetSequence Analysis RNAbusiness.industryGene Expression ProfilingHigh-Throughput Nucleotide SequencingComputer Science ApplicationsGene expression profilingComputational MathematicsComputational Theory and MathematicsGene chip analysisData miningPersonalized medicineDatabases Nucleic AcidbusinesscomputerSoftwareBioinformatics
researchProduct

Towards next-generation diagnostics for tuberculosis: identification of novel molecular targets by large-scale comparative genomics.

2020

5 páginas, 2 figuras. AVAILABILITY AND IMPLEMENTATION: The database of non-tuberculous mycobacteria assemblies can be accessed at: 10.5281/zenodo.3374377. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online: http://dx.doi.org/10.1093/bioinformatics/btz729

Statistics and ProbabilityTuberculosisGenomicsComputational biologyBiologyBiochemistryMycobacterium tuberculosis03 medical and health sciencesmedicineHumansTuberculosisDiscovery NotesMolecular Biology030304 developmental biologyComparative genomics0303 health sciences030306 microbiologyScale (chemistry)GenomicsMycobacterium tuberculosismedicine.diseasebiology.organism_classificationGenome Analysis3. Good healthComputer Science ApplicationsComputational MathematicsComputational Theory and MathematicsMycobacterium tuberculosis complexMolecular targetsIdentification (biology)BiomarkersBioinformatics (Oxford, England)
researchProduct

RabbitMash: accelerating hash-based genome analysis on modern multi-core architectures

2020

Abstract Motivation Mash is a popular hash-based genome analysis toolkit with applications to important downstream analyses tasks such as clustering and assembly. However, Mash is currently not able to fully exploit the capabilities of modern multi-core architectures, which in turn leads to high runtimes for large-scale genomic datasets. Results We present RabbitMash, an efficient highly optimized implementation of Mash which can take full advantage of modern hardware including multi-threading, vectorization and fast I/O. We show that our approach achieves speedups of at least 1.3, 9.8, 8.5 and 4.4 compared to Mash for the operations sketch, dist, triangle and screen, respectively. Furtherm…

Statistics and ProbabilityWorkstationExploitComputer scienceHash functionParallel computingBiochemistrylaw.invention03 medical and health sciencesSoftwarelawCluster analysisMolecular Biology030304 developmental biology0303 health sciencesMulti-core processorGenomeComputersbusiness.industry030302 biochemistry & molecular biologyGenomicsSketchComputer Science ApplicationsComputational MathematicsComputational Theory and MathematicsbusinessAlgorithmsSoftwareBioinformatics
researchProduct

Robust Mean Field Games

2015

Recently there has been renewed interest in large-scale games in several research disciplines, with diverse application domains as in the smart grid, cloud computing, financial markets, biochemical reaction networks, transportation science, and molecular biology. Prior works have provided rich mathematical foundations and equilibrium concepts but relatively little in terms of robustness in the presence of uncertainties. In this paper, we study mean field games with uncertainty in both states and payoffs. We consider a population of players with individual states driven by a standard Brownian motion and a disturbance term. The contribution is threefold: First, we establish a mean field syste…

Statistics and Probabilitygame theory0209 industrial biotechnologyEconomics and EconometricsMathematical optimizationPopulationCloud computing02 engineering and technology01 natural sciencessymbols.namesake020901 industrial engineering & automationResource (project management)Wiener processSettore ING-INF/04 - AutomaticaRobustness (computer science)0101 mathematicseducationMathematicseducation.field_of_studybusiness.industryApplied Mathematics010102 general mathematicsComputer Graphics and Computer-Aided DesignComputer Science ApplicationsTerm (time)Computational MathematicsSmart gridComputational Theory and MathematicsNash equilibriumsymbolsmean field gamestochastic optimal controlSettore MAT/09 - Ricerca OperativabusinessMathematical economics
researchProduct

Bioclimatic atlas of the terrestrial Arctic

2023

AbstractThe Arctic is the region on Earth that is warming at the fastest rate. In addition to rising means of temperature-related variables, Arctic ecosystems are affected by increasingly frequent extreme weather events causing disturbance to Arctic ecosystems. Here, we introduce a new dataset of bioclimatic indices relevant for investigating the changes of Arctic terrestrial ecosystems. The dataset, called ARCLIM, consists of several climate and event-type indices for the northern high-latitude land areas > 45°N. The indices are calculated from the hourly ERA5-Land reanalysis data for 1950–2021 in a spatial grid of 0.1 degree (~9 km) resolution. The indices are provided in three subsets…

Statistics and Probabilityhiilidioksidiarctic regionmeltingclimate changeswarmingPhysiologyEventsrainfallLibrary and Information SciencesklimatologiaEducationeliömaantiedeSnowilmastoSpecies distribution modelsVariabilityClimate-changeclimate1172 Environmental sciencesbiogeographyarktinen aluetemperaturecarbon dioxidesulaminenclimatologyilmastonmuutoksetecosystems (ecology)ekologiaComputer Science Applicationsekosysteemit (ekologia)sademääräclimate changeImpactsSea-icelämpötilaStatistics Probability and UncertaintyTrendslämpeneminenInformation Systemsclimate-change ecology
researchProduct