Search results for " Applications"
showing 10 items of 4541 documents
kmcEx: memory-frugal and retrieval-efficient encoding of counted k-mers.
2018
Abstract Motivation K-mers along with their frequency have served as an elementary building block for error correction, repeat detection, multiple sequence alignment, genome assembly, etc., attracting intensive studies in k-mer counting. However, the output of k-mer counters itself is large; very often, it is too large to fit into main memory, leading to highly narrowed usability. Results We introduce a novel idea of encoding k-mers as well as their frequency, achieving good memory saving and retrieval efficiency. Specifically, we propose a Bloom filter-like data structure to encode counted k-mers by coupled-bit arrays—one for k-mer representation and the other for frequency encoding. Exper…
ArtiFuse—computational validation of fusion gene detection tools without relying on simulated reads
2019
Abstract Motivation Gene fusions are an important class of transcriptional variants that can influence cancer development and can be predicted from RNA sequencing (RNA-seq) data by multiple existing tools. However, the real-world performance of these tools is unclear due to the lack of known positive and negative events, especially with regard to fusion genes in individual samples. Often simulated reads are used, but these cannot account for all technical biases in RNA-seq data generated from real samples. Results Here, we present ArtiFuse, a novel approach that simulates fusion genes by sequence modification to the genomic reference, and therefore, can be applied to any RNA-seq dataset wit…
NeoFox: annotating neoantigen candidates with neoantigen features
2020
Abstract Summary The detection and prediction of true neoantigens is of great importance for the field of cancer immunotherapy. Wesearched the literature for proposed neoantigen features and integrated them into a toolbox called NEOantigen Feature toolbOX (NeoFox). NeoFox is an easy-to-use Python package that enables the annotation of neoantigen candidates with 16 neoantigen features. Availability and implementation NeoFox is freely available as an open source Python package released under the GNU General Public License (GPL) v3 license at https://github.com/TRON-Bioinformatics/neofox. Supplementary information Supplementary data are available at Bioinformatics online.
IntelliPy: a GUI for analyzing IntelliCage data
2021
Abstract Summary The IntelliCage systems offer the possibility to conduct long-term behavioral experiments on mice in social groups without human intervention. Although this setup provides new findings, only about 150 studies with the IntelliCage system have been published in the last two decades, which is also caused by the challenging problems of processing and handling the large and heterogeneous amounts of captured data. This application note introduces the Python-GUI IntelliPy, especially designed for users not very experienced in using programming languages. IntelliPy allows users to quickly analyze the IntelliCage output in a user-friendly way, thus making the systems more accessible…
LipiDisease: associate lipids to diseases using literature mining
2021
Abstract Summary Lipids exhibit an essential role in cellular assembly and signaling. Dysregulation of these functions has been linked with many complications including obesity, diabetes, metabolic disorders, cancer and more. Investigating lipid profiles in such conditions can provide insights into cellular functions and possible interventions. Hence the field of lipidomics is expanding in recent years. Even though the role of individual lipids in diseases has been investigated, there is no resource to perform disease enrichment analysis considering the cumulative association of a lipid set. To address this, we have implemented the LipiDisease web server. The tool analyzes millions of recor…
RNA-Seq Atlas—a reference database for gene expression profiling in normal tissue by next-generation sequencing
2012
Abstract Motivation: Next-generation sequencing technology enables an entirely new perspective for clinical research and will speed up personalized medicine. In contrast to microarray-based approaches, RNA-Seq analysis provides a much more comprehensive and unbiased view of gene expression. Although the perspective is clear and the long-term success of this new technology obvious, bioinformatics resources making these data easily available especially to the biomedical research community are still evolving. Results: We have generated RNA-Seq Atlas, a web-based repository of RNA-Seq gene expression profiles and query tools. The website offers open and easy access to RNA-Seq gene expression pr…
Towards next-generation diagnostics for tuberculosis: identification of novel molecular targets by large-scale comparative genomics.
2020
5 páginas, 2 figuras. AVAILABILITY AND IMPLEMENTATION: The database of non-tuberculous mycobacteria assemblies can be accessed at: 10.5281/zenodo.3374377. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online: http://dx.doi.org/10.1093/bioinformatics/btz729
RabbitMash: accelerating hash-based genome analysis on modern multi-core architectures
2020
Abstract Motivation Mash is a popular hash-based genome analysis toolkit with applications to important downstream analyses tasks such as clustering and assembly. However, Mash is currently not able to fully exploit the capabilities of modern multi-core architectures, which in turn leads to high runtimes for large-scale genomic datasets. Results We present RabbitMash, an efficient highly optimized implementation of Mash which can take full advantage of modern hardware including multi-threading, vectorization and fast I/O. We show that our approach achieves speedups of at least 1.3, 9.8, 8.5 and 4.4 compared to Mash for the operations sketch, dist, triangle and screen, respectively. Furtherm…
Robust Mean Field Games
2015
Recently there has been renewed interest in large-scale games in several research disciplines, with diverse application domains as in the smart grid, cloud computing, financial markets, biochemical reaction networks, transportation science, and molecular biology. Prior works have provided rich mathematical foundations and equilibrium concepts but relatively little in terms of robustness in the presence of uncertainties. In this paper, we study mean field games with uncertainty in both states and payoffs. We consider a population of players with individual states driven by a standard Brownian motion and a disturbance term. The contribution is threefold: First, we establish a mean field syste…
Bioclimatic atlas of the terrestrial Arctic
2023
AbstractThe Arctic is the region on Earth that is warming at the fastest rate. In addition to rising means of temperature-related variables, Arctic ecosystems are affected by increasingly frequent extreme weather events causing disturbance to Arctic ecosystems. Here, we introduce a new dataset of bioclimatic indices relevant for investigating the changes of Arctic terrestrial ecosystems. The dataset, called ARCLIM, consists of several climate and event-type indices for the northern high-latitude land areas > 45°N. The indices are calculated from the hourly ERA5-Land reanalysis data for 1950–2021 in a spatial grid of 0.1 degree (~9 km) resolution. The indices are provided in three subsets…