Search results for "throughput"

showing 10 items of 483 documents

MetaCache: context-aware classification of metagenomic reads using minhashing.

2017

Abstract Motivation Metagenomic shotgun sequencing studies are becoming increasingly popular with prominent examples including the sequencing of human microbiomes and diverse environments. A fundamental computational problem in this context is read classification, i.e. the assignment of each read to a taxonomic label. Due to the large number of reads produced by modern high-throughput sequencing technologies and the rapidly increasing number of available reference genomes corresponding software tools suffer from either long runtimes, large memory requirements or low accuracy. Results We introduce MetaCache—a novel software for read classification using the big data technique minhashing. Our…

0301 basic medicineStatistics and ProbabilityComputer scienceSequence analysisContext (language use)BiochemistryGenome03 medical and health scienceschemistry.chemical_compound0302 clinical medicineRefSeqHumansMolecular BiologyInformation retrievalShotgun sequencingHigh-Throughput Nucleotide SequencingSequence Analysis DNAComputer Science ApplicationsComputational Mathematics030104 developmental biologyComputational Theory and MathematicschemistryMetagenomicsMetagenomics030217 neurology & neurosurgeryDNAAlgorithmsSoftwareReference genomeBioinformatics (Oxford, England)

researchProduct

ParDRe: faster parallel duplicated reads removal tool for sequencing studies

2016

This is a pre-copyedited, author-produced version of an article accepted for publication in Bioinformatics following peer review. The version of record [insert complete citation information here] is available online at: https://doi.org/10.1093/bioinformatics/btw038 [Abstract] Summary: Current next generation sequencing technologies often generate duplicated or near-duplicated reads that (depending on the application scenario) do not provide any interesting biological information but can increase memory requirements and computational time of downstream analysis. In this work we present ParDRe , a de novo parallel tool to remove duplicated and near-duplicated reads through the clustering of S…

0301 basic medicineStatistics and ProbabilityFASTQ formatDNA stringsSource codeDownstream (software development)Computer sciencemedia_common.quotation_subjectParallel computingcomputer.software_genreBiochemistryDNA sequencing03 medical and health scienceschemistry.chemical_compound0302 clinical medicineHybrid MPI/multithreadingCluster AnalysisParDReMolecular BiologyGenemedia_commonHigh-Throughput Nucleotide SequencingSequence Analysis DNAParallel toolComputer Science ApplicationsComputational Mathematics030104 developmental biologyComputational Theory and MathematicschemistryData miningcomputerAlgorithms030217 neurology & neurosurgeryDNABioinformatics

researchProduct

panISa: ab initio detection of insertion sequences in bacterial genomes from short read sequence data.

2018

Abstract Motivation The advent of next-generation sequencing has boosted the analysis of bacterial genome evolution. Insertion sequence (IS) elements play a key role in prokaryotic genome organization and evolution, but their repetitions in genomes complicate their detection from short-read data. Results PanISa is a software pipeline that identifies IS insertions ab initio in bacterial genomes from short-read data. It is a highly sensitive and precise tool based on the detection of read-mapping patterns at the insertion site. PanISa performs better than existing IS detection systems as it is based on a database-free approach. We applied it to a high-risk clone lineage of the pathogenic spec…

0301 basic medicineStatistics and ProbabilityLineage (genetic)Computer scienceAb initioComputational biologyBacterial genome size[INFO.INFO-SE]Computer Science [cs]/Software Engineering [cs.SE]BiochemistryGenome[INFO.INFO-IU]Computer Science [cs]/Ubiquitous Computing03 medical and health sciences[INFO.INFO-CR]Computer Science [cs]/Cryptography and Security [cs.CR][SDV.BBM.GTP]Life Sciences [q-bio]/Biochemistry Molecular Biology/Genomics [q-bio.GN]Insertion sequenceMolecular BiologyGenomic organizationHigh-Throughput Nucleotide SequencingSequence Analysis DNA[SDV.BIBS]Life Sciences [q-bio]/Quantitative Methods [q-bio.QM][SDV.MP.BAC]Life Sciences [q-bio]/Microbiology and Parasitology/BacteriologyPipeline (software)[INFO.INFO-MO]Computer Science [cs]/Modeling and SimulationComputer Science ApplicationsComputational Mathematics030104 developmental biologyComputational Theory and Mathematics[INFO.INFO-MA]Computer Science [cs]/Multiagent Systems [cs.MA]DNA Transposable Elements[INFO.INFO-ET]Computer Science [cs]/Emerging Technologies [cs.ET][INFO.INFO-DC]Computer Science [cs]/Distributed Parallel and Cluster Computing [cs.DC]Genome BacterialSoftwareBioinformatics (Oxford, England)

researchProduct

AFS: identification and quantification of species composition by metagenomic sequencing

2017

Abstract Summary DNA-based methods to detect and quantify taxon composition in biological materials are often based on species-specific polymerase chain reaction, limited to detecting species targeted by the assay. Next-generation sequencing overcomes this drawback by untargeted shotgun sequencing of whole metagenomes at affordable cost. Here we present AFS, a software pipeline for quantification of species composition in food. AFS uses metagenomic shotgun sequencing and sequence read counting to infer species proportions. Using Illumina data from a reference sausage comprising four species, we reveal that AFS is independent of the sequencing assay and library preparation protocol. Cost-sav…

0301 basic medicineStatistics and ProbabilitySequence analysisLibrary preparationComputational biologyBiologyBioinformaticsBiochemistrylaw.invention03 medical and health sciences0404 agricultural biotechnologylawMolecular BiologyPolymerase chain reactionShotgun sequencingHigh-Throughput Nucleotide SequencingSequence Analysis DNA04 agricultural and veterinary sciencesAccession number (bioinformatics)040401 food scienceBiological materialsComputer Science ApplicationsComputational Mathematics030104 developmental biologyComputational Theory and MathematicsMetagenomicsFood MicrobiologyIdentification (biology)MetagenomicsSoftwareBioinformatics

researchProduct

One-Cell Doubling Evaluation by Living Arrays of Yeast, ODELAY!

2016

Abstract Cell growth is a complex phenotype widely used in systems biology to gauge the impact of genetic and environmental perturbations. Due to the magnitude of genome-wide studies, resolution is often sacrificed in favor of throughput, creating a demand for scalable, time-resolved, quantitative methods of growth assessment. We present ODELAY (One-cell Doubling Evaluation by Living Arrays of Yeast), an automated and scalable growth analysis platform. High measurement density and single-cell resolution provide a powerful tool for large-scale multiparameter growth analysis based on the modeling of microcolony expansion on solid media. Pioneered in yeast but applicable to other colony formin…

0301 basic medicineSystems biologySaccharomyces cerevisiaeCellBioengineeringSaccharomyces cerevisiaeInvestigationsBiologyyeastQH426-470lag time03 medical and health sciencesGenetic HeterogeneityLag timeSingle-cell analysismedicinePopulation Heterogeneitycarrying capacityGeneticsDoubling timeMolecular BiologyThroughput (business)Genetics (clinical)030304 developmental biologyCell Proliferation0303 health sciencesGenomeEcology030306 microbiologyCell growthSystems BiologyCell CycleHuman Genomebiology.organism_classificationYeast030104 developmental biologymedicine.anatomical_structurePhenotypeFungalGene-Environment Interactiongrowth ratefitness assessmentGeneric health relevanceGenome FungalSingle-Cell AnalysisBiological systemG3: Genes, Genomes, Genetics

researchProduct

Do next-generation sequencing results drive diagnostic and therapeutic decisions in MDS?

2019

Este artículo se encuentra disponible en la siguiente URL: https://ashpublications.org/bloodadvances/article/3/21/3454/422749/Do-next-generation-sequencing-results-drive

0301 basic medicineSíndromes mielodisplásicos - Aspectos moleculares.Clinical Decision-MakingMEDLINEComputational biologyDNA sequencing03 medical and health sciences0302 clinical medicineText miningHumansMedicineGenetic Predisposition to DiseaseSangre - Células - Aspectos moleculares.Molecular Targeted TherapyGenes.Genetic Association StudiesBlood cells - Molecular aspects.business.industryDecision TreesDisease ManagementHigh-Throughput Nucleotide SequencingGenomicsHematologyPrognosisCombined Modality TherapyMyelodysplastic syndrome - Molecular aspects.030104 developmental biologyMyelodysplastic Syndromes030220 oncology & carcinogenesisMutationPoint-CounterpointMolecular biology.Biología molecular.businessBiomarkersBlood Advances

researchProduct

Parallel and Space-Efficient Construction of Burrows-Wheeler Transform and Suffix Array for Big Genome Data

2016

Next-generation sequencing technologies have led to the sequencing of more and more genomes, propelling related research into the era of big data. In this paper, we present ParaBWT, a parallelized Burrows-Wheeler transform (BWT) and suffix array construction algorithm for big genome data. In ParaBWT, we have investigated a progressive construction approach to constructing the BWT of single genome sequences in linear space complexity, but with a small constant factor. This approach has been further parallelized using multi-threading based on a master-slave coprocessing model. After gaining the BWT, the suffix array is constructed in a memory-efficient manner. The performance of ParaBWT has b…

0301 basic medicineTheoretical computer scienceBurrows–Wheeler transformComputer scienceGenomicsData_CODINGANDINFORMATIONTHEORYParallel computingGenomelaw.invention03 medical and health scienceslawGeneticsHumansEnsemblMulti-core processorApplied MathematicsLinear spaceSuffix arrayChromosome MappingHigh-Throughput Nucleotide SequencingGenomicsSequence Analysis DNA030104 developmental biologyAlgorithmsBiotechnologyReference genomeIEEE/ACM Transactions on Computational Biology and Bioinformatics

researchProduct

Accelerating metagenomic read classification on CUDA-enabled GPUs.

2016

Metagenomic sequencing studies are becoming increasingly popular with prominent examples including the sequencing of human microbiomes and diverse environments. A fundamental computational problem in this context is read classification; i.e. the assignment of each read to a taxonomic label. Due to the large number of reads produced by modern high-throughput sequencing technologies and the rapidly increasing number of available reference genomes software tools for fast and accurate metagenomic read classification are urgently needed. We present cuCLARK, a read-level classifier for CUDA-enabled GPUs, based on the fast and accurate classification of metagenomic sequences using reduced k-mers (…

0301 basic medicineTheoretical computer scienceWorkstationGPUsComputer scienceContext (language use)CUDAParallel computingBiochemistryGenomelaw.invention03 medical and health sciencesCUDAUser-Computer Interface0302 clinical medicineStructural BiologylawTaxonomic assignmentHumansMicrobiomeMolecular BiologyInternetXeonApplied MathematicsHigh-Throughput Nucleotide SequencingSequence Analysis DNAExact k-mer matchingComputer Science Applications030104 developmental biologyTitan (supercomputer)Metagenomics030220 oncology & carcinogenesisMetagenomicsDNA microarraySoftwareBMC bioinformatics

researchProduct

Human Upcyte Hepatocytes: Characterization of the Hepatic Phenotype and Evaluation for Acute and Long-Term Hepatotoxicity Routine Testing

2016

The capacity of human hepatic cell-based models to predict hepatotoxicity depends on the functional performance of cells. The major limitations of human hepatocytes include the scarce availability and rapid loss of the hepatic phenotype. Hepatoma cells are readily available and easy to handle, but are metabolically poor compared with hepatocytes. Recently developed human upcyte hepatocytes offer the advantage of combining many features of primary hepatocytes with the unlimited availability of hepatoma cells. We analyzed the phenotype of upcyte hepatocytes comparatively with HepG2 cells and adult primary human hepatocytes to characterize their functional features as a differentiated hepatic …

0301 basic medicineTime FactorsPrimary Cell CultureTransfectionToxicologyRisk AssessmentTranscriptome03 medical and health sciences0302 clinical medicineMetabolomicsCytochrome P-450 Enzyme SystemIn vivoToxicity TestsmedicineHumansChildGlycogen synthaseDose-Response Relationship DrugbiologyInfant NewbornCytochrome P450Hep G2 CellsMiddle Agedmedicine.diseasePhenotypeHigh-Throughput Screening AssaysIsoenzymesOxidative StressPhenotype030104 developmental biologyGene Expression RegulationLiver030220 oncology & carcinogenesisHepatocytesbiology.proteinHepatic stellate cellCancer researchChemical and Drug Induced Liver InjurySteatosisTranscriptomeToxicological Sciences

researchProduct

Characterisation of CDKL5 Transcript Isoforms in Human and Mouse.

2016

Mutations in the X-linked Cyclin-Dependent Kinase-Like 5 gene (CDKL5) cause early onset infantile spasms and subsequent severe developmental delay in affected children. Deleterious mutations have been reported to occur throughout the CDKL5 coding region. Several studies point to a complex CDKL5 gene structure in terms of exon usage and transcript expression. Improvements in molecular diagnosis and more extensive research into the neurobiology of CDKL5 and pathophysiology of CDKL5 disorders necessitate an updated analysis of the gene. In this study, we have analysed human and mouse CDKL5 transcript patterns both bioinformatically and experimentally. We have characterised the predominant brai…

0301 basic medicineUntranslated regionTranscription GeneticCDKL5lcsh:MedicineGene ExpressionArtificial Gene Amplification and ExtensionPolymerase Chain ReactionBiochemistryExonMice0302 clinical medicineCoding regionProtein Isoformslcsh:ScienceGeneticsRegulation of gene expressionMultidisciplinaryMammalian GenomicsHigh-Throughput Nucleotide SequencingExonsGenomicsNucleic acidsRNA isolationPhenotypeSpasms InfantileResearch ArticleGene isoformBiologyProtein Serine-Threonine KinasesPolyadenylationResearch and Analysis MethodsBiomolecular isolation03 medical and health sciencesGeneticsAnimalsHumansAdultsAmino Acid SequenceMolecular Biology TechniquesGeneMolecular BiologyAlternative splicinglcsh:RGene MappingInfant NewbornBiology and Life SciencesReverse Transcriptase-Polymerase Chain ReactionAlternative Splicing030104 developmental biologyGene Expression RegulationRNA processingAge GroupsAnimal GenomicsMutationPeople and PlacesExon MappingRNAlcsh:QPopulation Groupings030217 neurology & neurosurgeryPloS one

researchProduct