6533b86cfe1ef96bd12c80f5

RESEARCH PRODUCT

pcaExplorer: an R/Bioconductor package for interacting with RNA-seq principal components

Harald BinderFederico Marini

subject

Computer scienceInterface (computing)ShinyBioconductorPrincipal component analysis610 MedizinRNA-SeqGenomicslcsh:Computer applications to medicine. Medical informaticsReproducible researchBioconductorTranscriptomeExploratory data analysisUser-friendly610 Medical sciencesGene expressionHumansRNA-SeqGenelcsh:QH301-705.5Data CurationBase Sequencebusiness.industrySequence Analysis RNARRNAReproducibility of Resultslcsh:Biology (General)Principal component analysisRNAlcsh:R858-859.7Software engineeringbusinessSoftware

description

AbstractBackgroundPrincipal component analysis (PCA) is frequently useentirely written ind in genomics applications for quality assessment and exploratory analysis in high-dimensional data, such as RNA sequencing (RNA-seq) gene expression assays. Despite the availability of many software packages developed for this purpose, an interactive and comprehensive interface for performing these operations is lacking.ResultsWe developed the pcaExplorer software package to enhance commonly performed analysis steps with an interactive and user-friendly application, which provides state saving as well as the automated creation of reproducible reports. pcaExplorer is implemented in R using the Shiny framework and exploits data structures from the open-source Bioconductor project. Users can easily generate a wide variety of publication-ready graphs, while assessing the expression data in the different modules available, including a general overview, dimension reduction on samples and genes, as well as functional interpretation of the principal components.ConclusionpcaExplorer is distributed as an R package in the Bioconductor project (http://bioconductor.org/packages/pcaExplorer/), and is designed to assist a broad range of researchers in the critical step of interactive data exploration.

https://dx.doi.org/10.25358/openscience-219