Search results for "Sequence assembly"

showing 6 items of 26 documents

De Novo Assembly and Functional Annotation of the Olive (Olea europaea) Transcriptome

2013

Olive breeding programmes are focused on selecting for traits as short juvenile period, plant architecture suited for mechanical harvest, or oil characteristics, including fatty acid composition, phenolic, and volatile compounds to suit new markets. Understanding the molecular basis of these characteristics and improving the efficiency of such breeding programmes require the development of genomic information and tools. However, despite its economic relevance, genomic information on olive or closely related species is still scarce. We have applied Sanger and 454 pyrosequencing technologies to generate close to 2 million reads from 12 cDNA libraries obtained from the Picual, Arbequina, and L…

Sanger sequencingSequence assemblyBreedingOleaDatabases GeneticBotanygenomicsGeneticsPlant OilsJuvenileCultivarOlea europaeaOlive OilMolecular BiologyGene LibraryExpressed Sequence TagsExpressed sequence tagbiologycDNA libraryMolecular Sequence AnnotationSequence Analysis DNAGeneral MedicineFull Papersbiology.organism_classificationMolecular Sequence AnnotationOleaFruitSeedsPyrosequencing454 pyrosequencingTranscriptomeGenome PlantDNA Research

researchProduct

Direct squencing from the minimal number of DNA molecules needed to fill a 454 picotiterplate

2014

Notice of Republication: This article was republished on June 17, 2014, to correct an error in the title. The publisher apologizes for the error. In addition, a typographical error was corrected in the Abstract. Please download this article again to view the correct version. The originally published, uncorrected article and the republished, corrected article are provided here for reference.

ScienceSequence assemblyHybrid genome assemblyBiologyDNA sequencingDeep sequencingGens humans MapatgeSequencing by hybridizationMapatgeEscherichia coliGeneticsCluster AnalysisGenome SequencingMolecular Biology TechniquesSequencing TechniquesMolecular BiologyGene LibraryGeneticsWhole Genome AmplificationMultidisciplinaryGenètica bacterianaShotgun sequencingQRMultiple displacement amplificationChromosome MappingHigh-Throughput Nucleotide SequencingBiology and Life SciencesComputational BiologySequence Analysis DNAGenomicsGenome AnalysisGens humansMedicineSequence AnalysisGenome BacterialResearch Article

researchProduct

Musket: a multistage k-mer spectrum-based error corrector for Illumina sequence data

2012

Abstract Motivation: The imperfect sequence data produced by next-generation sequencing technologies have motivated the development of a number of short-read error correctors in recent years. The majority of methods focus on the correction of substitution errors, which are the dominant error source in data produced by Illumina sequencing technology. Existing tools either score high in terms of recall or precision but not consistently high in terms of both measures. Results: In this article, we present Musket, an efficient multistage k-mer-based corrector for Illumina short-read data. We use the k-mer spectrum approach and introduce three correction techniques in a multistage workflow: two-s…

Statistics and ProbabilityComputer sciencebusiness.industrySequence assemblySequence Analysis DNAMusketBiochemistryComputer Science ApplicationsComputational MathematicsCUDASoftwareComputational Theory and Mathematicsk-merEscherichia coliChromosomes HumanHumansbusinessFocus (optics)Molecular BiologyAlgorithmAlgorithmsGenome BacterialSoftwareIllumina dye sequencingBioinformatics

researchProduct

CARE: context-aware sequencing read error correction.

2020

Abstract Motivation Error correction is a fundamental pre-processing step in many Next-Generation Sequencing (NGS) pipelines, in particular for de novo genome assembly. However, existing error correction methods either suffer from high false-positive rates since they break reads into independent k-mers or do not scale efficiently to large amounts of sequencing reads and complex genomes. Results We present CARE—an alignment-based scalable error correction algorithm for Illumina data using the concept of minhashing. Minhashing allows for efficient similarity search within large sequencing read collections which enables fast computation of high-quality multiple alignments. Sequencing errors ar…

Statistics and ProbabilityMultiple sequence alignmentComputer scienceSequence assemblyHigh-Throughput Nucleotide SequencingContext (language use)Sequence Analysis DNAcomputer.software_genreBiochemistryGenomeComputer Science ApplicationsComputational MathematicsComputational Theory and MathematicsHumansHuman genomeData miningError detection and correctionMolecular BiologycomputerSequence AlignmentAlgorithmsSoftwareBioinformatics (Oxford, England)

researchProduct

Fragmentation and Coverage Variation in Viral Metagenome Assemblies, and Their Effect in Diversity Calculations

2015

Metagenomic libraries consist of DNA fragments from diverse species, with varying genome size and abundance. High-throughput sequencing platforms produce large volumes of reads from these libraries, which may be assembled into contigs, ideally resembling the original larger genomic sequences. The uneven species distribution, along with the stochasticity in sample processing and sequencing bias, impacts the success of accurate sequence assembly. Several assemblers enable the processing of viral metagenomic data de novo, generally using overlap layout consensus or de Bruijn graph approaches for contig assembly. The success of viral genomic reconstruction in these datasets is limited by the de…

Viral metagenomicsHistologyOTUlcsh:BiotechnologySpecies distributionBiomedical EngineeringSequence assemblyBioengineeringComputational biologyBiologyassemblercomputer.software_genreGenomediversitylcsh:TP248.13-248.65Alpha diversityGenome sizeOriginal ResearchContigBioengineering and BiotechnologyMetagenomicsAlpha diversityBioinformacticsData miningviral metagenomicscomputerBiotechnologyFrontiers in Bioengineering and Biotechnology

researchProduct

De novogenome assembly of the land snailCandidula unifasciata(Mollusca: Gastropoda)

2021

AbstractAmong all molluscs, land snails are an economically and scientifically interesting group comprising edible species, alien species and agricultural pests. Yet, despite its high diversity, the number of whole genomes publicly available is still scarce. Here, we present the draft genome assembly of the land snailCandidula unifasciata, a widely distributed species along central Europe, which belongs to Geomitridae family, a group highly diversified in the Western-Palearctic region. We performed a whole genome sequencing, assembly and annotation of an adult specimen based on PacBio and Oxford Nanopore long read sequences as well as Illumina data. A genome of about 1.29 Gb was generated w…

Whole genome sequencingCandidula unifasciataeducation.field_of_studybiologyPopulationLand snailSequence assemblySnailbiology.organism_classificationGenomeEvolutionary biologybiology.animaleducationGene

researchProduct