Search results for "throughput"

showing 10 items of 483 documents

unitas: the universal tool for annotation of small RNAs

2017

AbstractBackgroundNext generation sequencing is a key technique in small RNA biology research that has led to the discovery of functionally different classes of small non-coding RNAs in the past years. However, reliable annotation of the extensive amounts of small non-coding RNA data produced by high-throughput sequencing is time-consuming and requires robust bioinformatics expertise. Moreover, existing tools have a number of shortcomings including a lack of sensitivity under certain conditions, limited number of supported species or detectable sub-classes of small RNAs.ResultsHere we introduce unitas, an out-of-the-box ready software for complete annotation of small RNA sequence datasets, …

Small RNAtRNA-derived fragments (tRFs)Computational biologypiRNABiologyDNA sequencing570 Life sciencesAnnotationEnsemblHumansRNA-seq data analysismiRNAGeneticsbusiness.industryphasiRNARNAHigh-Throughput Nucleotide SequencingUsabilityMolecular Sequence AnnotationNon-coding RNAKey (cryptography)RNA Small UntranslatedSmall non-coding RNAsbusinessSoftwareHeLa Cells570 Biowissenschaften
researchProduct

On the statistical properties of the capacity of amplify-and-forward channels under LOS conditions

2008

Amplify-and-forward channels in cooperative networks provide a promising improvement in the network coverage and system throughput. Under line-of-sight (LOS) propagation conditions in such cooperative networks, the overall fading channel can be modeled by a double Rice process. In this article, we have studied the statistical properties of the capacity of double Rice fading channels. We have derived the analytical expressions for the probability density function (PDF), cumulative distribution function (CDF), level-crossing rate (LCR), and average duration of fades (ADF) of the channel capacity. The obtained results are studied for different values of the amplitudes of the LOS components in …

Spatial correlationbusiness.industryCumulative distribution functionProbability density functionThroughputTopologyCommunications systemChannel capacityFading distributionFadingTelecommunicationsbusinessComputer Science::Information TheoryMathematics2008 11th IEEE Singapore International Conference on Communication Systems
researchProduct

Discovery and structure-activity relationship studies of irreversible benzisothiazolinone-based inhibitors against Staphylococcus aureus sortase A tr…

2014

Gram-positive bacteria, in general, and staphylococci, in particular, are the widespread cause of nosocomial and community-acquired infections. The rapid evolvement of strains resistant to antibiotics currently in use is a serious challenge. Novel antimicrobial compounds have to be developed to fight these resistant bacteria, and sortase A, a bacterial cell wall enzyme, is a promising target for novel therapies. As a transpeptidase that covalently attaches various virulence factors to the cell surface, this enzyme plays a crucial role in the ability of bacteria to invade the host's tissues and to escape the immune response. In this study we have screened a small molecule library against rec…

Staphylococcus aureusClinical BiochemistryPharmaceutical ScienceVirulenceStaphylococcal infectionsmedicine.disease_causeBiochemistryBacterial cell structureMicrobiologyStructure-Activity RelationshipBacterial ProteinsSortaseDrug DiscoverymedicineFluorescence Resonance Energy TransferHumansEnzyme InhibitorsMolecular BiologybiologyChemistryOrganic ChemistryStaphylococcal InfectionsAntimicrobialmedicine.diseasebiology.organism_classificationAminoacyltransferasesHigh-Throughput Screening AssaysMolecular Docking SimulationCysteine EndopeptidasesThiazolesBiochemistryStaphylococcus aureusSortase AMolecular MedicineBacteriaBioorganicmedicinal chemistry
researchProduct

CROSSMAPPER: estimating cross-mapping rates and optimizing experimental design in multi-species sequencing studies

2020

Motivation Numerous sequencing studies, including transcriptomics of host-pathogen systems, sequencing of hybrid genomes, xenografts, mixed species systems, metagenomics and meta-transcriptomics, involve samples containing genetic material from divergent organisms. A crucial step in these studies is identifying from which organism each sequencing read originated, and the experimental design should be directed to minimize biases caused by cross-mapping of reads to incorrect source genomes. Additionally, pooling of sufficiently different genetic material into a single sequencing library could significantly reduce experimental costs but requires careful planning and assessment of the impact of…

Statistics and Probability:Informàtica::Aplicacions de la informàtica::Bioinformàtica [Àrees temàtiques de la UPC]Computer sciencecomputer.software_genreBiochemistryGenomeTranscriptome03 medical and health sciencesResource (project management)GenomesTranscriptomicsMolecular BiologyOrganismGenòmica -- Informàtica030304 developmental biology0303 health sciences030306 microbiologyHigh-Throughput Nucleotide SequencingGenomicsSequence Analysis DNADNAGenome analysisGenome AnalysisAnàlisis de seqüènciesComputer Science ApplicationsApplications NoteComputational MathematicsComputational Theory and MathematicsCross-mappingResearch DesignMetagenomicsRNAData miningLine (text file)computerSoftwareGenèticaparametres
researchProduct

Cluster-Localized Sparse Logistic Regression for SNP Data

2012

The task of analyzing high-dimensional single nucleotide polymorphism (SNP) data in a case-control design using multivariable techniques has only recently been tackled. While many available approaches investigate only main effects in a high-dimensional setting, we propose a more flexible technique, cluster-localized regression (CLR), based on localized logistic regression models, that allows different SNPs to have an effect for different groups of individuals. Separate multivariable regression models are fitted for the different groups of individuals by incorporating weights into componentwise boosting, which provides simultaneous variable selection, hence sparse fits. For model fitting, th…

Statistics and ProbabilityBoosting (machine learning)Computer scienceMultivariable calculusComputational BiologyHigh-Throughput Nucleotide SequencingFeature selectionRegression analysisModels TheoreticalLogistic regressioncomputer.software_genrePolymorphism Single NucleotideRegressionComputational MathematicsLogistic ModelsData Interpretation StatisticalGeneticsCluster AnalysisHumansData miningCluster analysisMolecular BiologyUnit-weighted regressioncomputerGenome-Wide Association StudyStatistical Applications in Genetics and Molecular Biology
researchProduct

Pathway analysis of high-throughput biological data within a Bayesian network framework

2011

Abstract Motivation: Most current approaches to high-throughput biological data (HTBD) analysis either perform individual gene/protein analysis or, gene/protein set enrichment analysis for a list of biologically relevant molecules. Bayesian Networks (BNs) capture linear and non-linear interactions, handle stochastic events accounting for noise, and focus on local interactions, which can be related to causal inference. Here, we describe for the first time an algorithm that models biological pathways as BNs and identifies pathways that best explain given HTBD by scoring fitness of each network. Results: Proposed method takes into account the connectivity and relatedness between nodes of the p…

Statistics and ProbabilityComputer scienceHigh-throughput screeningGene regulatory networkcomputer.software_genreModels BiologicalBiochemistrySynthetic dataBiological pathwayBayes' theoremHumansGene Regulatory NetworksCarcinoma Renal CellMolecular BiologyGeneBiological dataMicroarray analysis techniquesGene Expression ProfilingBayesian networkRobustness (evolution)Bayes TheoremPathway analysisKidney NeoplasmsHigh-Throughput Screening AssaysComputer Science ApplicationsGene expression profilingComputational MathematicsComputational Theory and MathematicsCausal inferenceData miningcomputerAlgorithmsSoftwareBioinformatics
researchProduct

Acceleration of short and long DNA read mapping without loss of accuracy using suffix array

2014

HPG Aligner applies suffix arrays for DNA read mapping. This implementation produces a highly sensitive and extremely fast mapping of DNA reads that scales up almost linearly with read length. The approach presented here is faster (over 20 for long reads) and more sensitive (over 98% in a wide range of read lengths) than the current state-of-the-art mappers. HPG Aligner is not only an optimal alternative for current sequencers but also the only solution available to cope with longer reads and growing throughputs produced by forthcoming sequencing technologies.

Statistics and ProbabilityComputer scienceSequence analysisSequence alignmentdatabase searchescomputer.software_genreBiochemistrylaw.inventionAccelerationchemistry.chemical_compoundlawCIENCIAS DE LA COMPUTACION E INTELIGENCIA ARTIFICIALAnimalsHumansMolecular BiologyDatabasesequencing dataSuffix arraySequence analysisHigh-Throughput Nucleotide SequencingalignmentSequence Analysis DNAApplications NotesComputer Science ApplicationsComputational MathematicsComputational Theory and MathematicschemistryDrosophilaSuffixSequence AlignmentcomputerAlgorithmAlgorithmsSoftwareDNA
researchProduct

MLML2R: an R package for maximum likelihood estimation of DNA methylation and hydroxymethylation proportions.

2019

Abstract Accurately measuring epigenetic marks such as 5-methylcytosine (5-mC) and 5-hydroxymethylcytosine (5-hmC) at the single-nucleotide level, requires combining data from DNA processing methods including traditional (BS), oxidative (oxBS) or Tet-Assisted (TAB) bisulfite conversion. We introduce the R package MLML2R, which provides maximum likelihood estimates (MLE) of 5-mC and 5-hmC proportions. While all other available R packages provide 5-mC and 5-hmC MLEs only for the oxBS+BS combination, MLML2R also provides MLE for TAB combinations. For combinations of any two of the methods, we derived the pool-adjacent-violators algorithm (PAVA) exact constrained MLE in analytical form. For the…

Statistics and ProbabilityDNA HydroxymethylationEpigenomicsIterative methodMaximum likelihood03 medical and health sciencessymbols.namesake0302 clinical medicineGeneticsHumansMolecular Biology030304 developmental biologyMathematics0303 health sciencesLikelihood FunctionsComputational BiologyHigh-Throughput Nucleotide SequencingProbability and statisticsDNA MethylationComputational MathematicsR packageLagrange multiplierDNA methylationsymbolsIterative approximationAlgorithm030217 neurology & neurosurgeryStatistical applications in genetics and molecular biology
researchProduct

Galaxy LIMS for next-generation sequencing.

2013

Abstract Summary: We have developed a laboratory information management system (LIMS) for a next-generation sequencing (NGS) laboratory within the existing Galaxy platform. The system provides lab technicians standard and customizable sample information forms, barcoded submission forms, tracking of input sample quality, multiplex-capable automatic flow cell design and automatically generated sample sheets to aid physical flow cell preparation. In addition, the platform provides the researcher with a user-friendly interface to create a request, submit accompanying samples, upload sample quality measurements and access to the sequencing results. As the LIMS is within the Galaxy platform, the …

Statistics and ProbabilityDatabasebusiness.industryComputer scienceSample (material)Interface (computing)High-Throughput Nucleotide Sequencingcomputer.software_genreBiochemistryDNA sequencingComputer Science ApplicationsWorkflowWorld Wide WebComputational MathematicsUser-Computer InterfaceSoftwareComputational Theory and MathematicsbusinessMolecular BiologycomputerSoftwareInformation SystemsBioinformatics (Oxford, England)
researchProduct

Adaptive reference-free compression of sequence quality scores

2014

Motivation: Rapid technological progress in DNA sequencing has stimulated interest in compressing the vast datasets that are now routinely produced. Relatively little attention has been paid to compressing the quality scores that are assigned to each sequence, even though these scores may be harder to compress than the sequences themselves. By aggregating a set of reads into a compressed index, we find that the majority of bases can be predicted from the sequence of bases that are adjacent to them and hence are likely to be less informative for variant calling or other applications. The quality scores for such bases are aggressively compressed, leaving a relatively small number at full reso…

Statistics and ProbabilityFOS: Computer and information sciencesComputer sciencemedia_common.quotation_subjectReference-freecomputer.software_genreBiochemistryDNA sequencingSet (abstract data type)Redundancy (information theory)BWTComputer Science - Data Structures and AlgorithmsCode (cryptography)AnimalsHumansQuality (business)Data Structures and Algorithms (cs.DS)Quantitative Biology - GenomicsCaenorhabditis elegansMolecular Biologymedia_commonGenomics (q-bio.GN)SequenceGenomeSettore INF/01 - Informaticareference-free compressionHigh-Throughput Nucleotide SequencingGenomicsSequence Analysis DNAData CompressioncompressionComputer Science ApplicationsComputational MathematicsComputational Theory and MathematicsFOS: Biological sciencesData miningquality scoreMetagenomicscomputerBWT; compression; quality score; reference-free compressionAlgorithmsReference genome
researchProduct