Search results for "Sequence assembly"

showing 10 items of 26 documents

De novo transcriptome assembly and developmental mode specific gene expression of Pygospio elegans

2017

Species with multiple different larval developmental modes are interesting models for the study of mechanisms underlying developmental mode transitions and life history evolution. Pygospio elegans, a small, tube-dwelling polychaete worm commonly found in estuarine and marine habitats around the northern hemisphere, is one species with variable developmental modes. To provide new genomic resources for studying P. elegans and to address the differences in gene expression between individuals producing offspring with different larval developmental modes, we performed whole transcriptome Illumina RNA sequencing of adult worms from two populations and prepared a de novo assembly of the P. elegans…

0106 biological sciences0301 basic medicineDe novo transcriptome assemblySequence assemblyBiology010603 evolutionary biology01 natural sciencesTranscriptome03 medical and health sciencesGene expressionAnimalsgeeniekspressioGenes Developmental14. Life underwaterEcology Evolution Behavior and SystematicsGeneticsLarvaPolychaeteGene Expression ProfilingfungiGene Expression Regulation DevelopmentalRNAMolecular Sequence AnnotationPolychaetaMarine invertebratesbiology.organism_classification030104 developmental biologyLarvagene expressionta1181TranscriptomeMicrosatellite RepeatsDevelopmental BiologyEvolution & Development
researchProduct

De novo genome assembly of the land snail Candidula unifasciata (Mollusca: Gastropoda)

2021

Abstract Among all molluscs, land snails are a scientifically and economically interesting group comprising edible species, alien species and agricultural pests. Yet, despite their high diversity, the number of genome drafts publicly available is still scarce. Here, we present the draft genome assembly of the land snail Candidula unifasciata, a widely distributed species along central Europe, belonging to the Geomitridae family, a highly diversified taxon in the Western-Palearctic region. We performed whole genome sequencing, assembly and annotation of an adult specimen based on PacBio and Oxford Nanopore long read sequences as well as Illumina data. A genome draft of about 1.29 Gb was gene…

0106 biological sciencesCandidula unifasciataAcademicSubjects/SCI01140AcademicSubjects/SCI00010repeatsPopulationSnailsSequence assemblySnailQH426-470de novo assemblyAcademicSubjects/SCI01180010603 evolutionary biology01 natural sciencesGenome03 medical and health sciencesbiology.animalland snailslong readsGeneticsAnimalseducationMolecular BiologyGeneGenetics (clinical)030304 developmental biologyWhole genome sequencingGeomitridaemolluscs0303 health scienceseducation.field_of_studyGenomebiologyLand snailMolecular Sequence AnnotationGenomicsSequence Analysis DNAbiology.organism_classificationGenome ReportannotationEvolutionary biologyAcademicSubjects/SCI00960G3: Genes|Genomes|Genetics
researchProduct

High-Quality Genome Assembly and Annotation of the Big-Eye Mandarin Fish (Siniperca knerii)

2020

Abstract The big-eye mandarin fish (Siniperca knerii) is an endemic species of southern China. It belongs to the family Sinipercidae, which is closely related to the well-known North American sunfish family Centrarchidae. Determining the genome sequence of S. knerii would provide a foundation for better examining its genetic diversity and population history. A novel sequenced genome of the Sinipercidae also would help in comparative study of the Centrarchidae using Siniperca as a reference. Here, we determined the genome sequence of S. knerii using 10x Genomics technology and next-generation sequencing. Paired-end sequencing on a half lane of HiSeq X platform generated 56 Gbp of raw data. R…

0106 biological sciencesGene predictionPopulationChinese perchSequence assemblyGenomicsSinipercaQH426-470BiologyGenome sequencing010603 evolutionary biology01 natural sciencesGenome03 medical and health sciencesGenome SizeGeneticsAnimalsSiniperca kneriieducationMolecular BiologyGenome sizeGenetics (clinical)030304 developmental biologyWhole genome sequencing0303 health scienceseducation.field_of_studyGenome assemblyGenome10x GenomicsFishesHigh-Throughput Nucleotide SequencingMolecular Sequence AnnotationGenomicsbiology.organism_classificationGenome ReportEvolutionary biologyG3: Genes|Genomes|Genetics
researchProduct

Next-generation biological control

2020

Biological control is widely successful at controlling pests, but effective biocontrol agents are now more difficult to import from countries of origin due to more restrictive international trade laws (the Nagoya Protocol). Coupled with increasing demand, the efficacy of existing and new biocontrol agents needs to be improved with genetic and genomic approaches. Although they have been underutilised in the past, application of genetic and genomic techniques is becoming more feasible from both technological and economic perspectives. We review current methods and provide a framework for using them. First, it is necessary to identify which biocontrol trait to select and in what direction. Nex…

0106 biological sciencesProteomicsH10 Pests of plantsInternationalityComputer science[SDV]Life Sciences [q-bio]Laboratory of VirologySequence assemblybiological controlmicrobiome01 natural sciencesGenome editinggeneticsNagoya ProtocolLaboratory of EntomologyCYTOPLASMIC INCOMPATIBILITY2. Zero hunger0303 health sciencesQUANTITATIVE TRAIT LOCICommercefood and beveragesCONTROL AGENTSPE&RCBiosystematiekNASONIA-VITRIPENNISGUT CONTENT-ANALYSIS[SDE]Environmental SciencesTraitinsect breedingAXYRIDIS COLEOPTERA-COCCINELLIDAEOriginal ArticleLaboratory of GeneticsLIFE-HISTORY TRAITSGeneral Agricultural and Biological SciencesGenomicsContext (language use)Computational biology[SDV.BID]Life Sciences [q-bio]/Biodiversityartificial selectionQuantitative trait locusAnimal Breeding and GenomicsLaboratorium voor Erfelijkheidsleer010603 evolutionary biologyGeneral Biochemistry Genetics and Molecular BiologyLaboratorium voor Virologiemodelling03 medical and health sciencesgenomics[SDV.BV]Life Sciences [q-bio]/Vegetal BiologyFokkerij en GenomicaPARASITOID WASPSelection (genetic algorithm)modelling.030304 developmental biologySEX DETERMINATIONOriginal ArticlesLaboratorium voor EntomologieWIASgenome assemblyBiosystematicsEPSartificial selection biological control genetics genome assembly genomics insect breeding microbiome modellingBiological Reviews
researchProduct

A haplotype-resolved, de novo genome assembly for the wood tiger moth (Arctia plantaginis) through trio binning

2020

ABSTRACT Background Diploid genome assembly is typically impeded by heterozygosity because it introduces errors when haplotypes are collapsed into a consensus sequence. Trio binning offers an innovative solution that exploits heterozygosity for assembly. Short, parental reads are used to assign parental origin to long reads from their F1 offspring before assembly, enabling complete haplotype resolution. Trio binning could therefore provide an effective strategy for assembling highly heterozygous genomes, which are traditionally problematic, such as insect genomes. This includes the wood tiger moth (Arctia plantaginis), which is an evolutionary study system for warning colour polymorphism. F…

0106 biological scienceshaplotypepopulation genomicsAcademicSubjects/SCI02254PopulationSequence assemblyHealth Informaticswood tiger moth; Arctia plantaginisMothsBiologyData Notegenotyyppi010603 evolutionary biology01 natural sciencesGenometäpläsiilikäsPopulation genomicsLoss of heterozygosity03 medical and health sciencesConsensus sequenceAnimalsHumanseducation030304 developmental biology0303 health scienceseducation.field_of_studyGenetic diversityGenometrio binningHaplotypewood tiger mothKaryotypegenomiikkaGenomicsWoodComputer Science ApplicationsLepidopteraHaplotypesannotationpopulaatiogenetiikkaEvolutionary biologyperimägenome assemblyAcademicSubjects/SCI00960Corrigendum
researchProduct

A high-quality genome assembly from short and long reads for the non-biting midge Chironomus riparius (Diptera)

2020

AbstractBackgroundChironomus riparius is of great importance as a study species in various fields like ecotoxicology, molecular genetics, developmental biology and ecology. However, only a fragmented draft genome exists to date, hindering the recent rush of population genomic studies in this species.FindingsMaking use of 50 NGS datasets, we present a hybrid genome assembly from short and long sequence reads that make C. riparius’ genome one of the most contiguous Dipteran genomes published, the first complete mitochondrial genome of the species and the respective recombination rate as one of the first insect recombination rates at all.ConclusionsThe genome and associated resources will be h…

0106 biological sciencesmedicine.medical_specialtyMitochondrial DNAEcology (disciplines)ved/biology.organism_classification_rank.speciesPopulationSequence assemblyHybrid genome assemblyQH426-470Biology010603 evolutionary biology01 natural sciencesGenomeChironomidae03 medical and health sciencesMolecular geneticschironomus ripariusGeneticsmedicineAnimalseducationMolecular BiologyGenetics (clinical)030304 developmental biologyrecombination rateChironomus riparius0303 health scienceseducation.field_of_studyGenomehybrid genome assemblyved/biologyGenome ReportEvolutionary biology
researchProduct

Transcriptome analysis and codominant markers development in caper, a drought tolerant orphan crop with medicinal value.

2019

AbstractCaper (Capparis spinosa L.) is a xerophytic shrub cultivated for its flower buds and fruits, used as food and for their medicinal properties. Breeding programs and even proper taxonomic classification of the genus Capparis has been hampered so far by the lack of reliable genetic information and molecular markers. Here, we present the first genomic resource for C. spinosa, generated by transcriptomic approach and de novo assembly. The sequencing effort produced nearly 80 million clean reads assembled into 124,723 unitranscripts. Careful annotation and comparison with public databases revealed homologs to genes with a key role in important metabolic pathways linked to abiotic stress t…

0301 basic medicineCapparisAgricultural geneticsabiotic stressSAPsPlant geneticsScienceDrought toleranceSequence assemblyComputational biologyBiologyArticleTranscriptome03 medical and health sciences0302 clinical medicinefoodStress PhysiologicalEST-SSRGeneorphan cropPlant Proteinsde novo leaf transcriptomeMultidisciplinaryPlants MedicinalPhenylpropanoidAbiotic stressSettore BIO/02 - Botanica SistematicaCapparis spinosaGene Expression ProfilingCaper Capparis spinosa Codominant markers Transcriptome analysis Orphan cropQRfood and beveragesbiology.organism_classificationfood.foodCapparis spinosa L.DroughtsCapparis030104 developmental biologyNGSMedicineTranscriptome030217 neurology & neurosurgeryBiomarkersMetabolic Networks and PathwaysScientific reports
researchProduct

Informational and linguistic analysis of large genomic sequence collections via efficient Hadoop cluster algorithms

2018

Abstract Motivation Information theoretic and compositional/linguistic analysis of genomes have a central role in bioinformatics, even more so since the associated methodologies are becoming very valuable also for epigenomic and meta-genomic studies. The kernel of those methods is based on the collection of k-mer statistics, i.e. how many times each k-mer in {A,C,G,T}k occurs in a DNA sequence. Although this problem is computationally very simple and efficiently solvable on a conventional computer, the sheer amount of data available now in applications demands to resort to parallel and distributed computing. Indeed, those type of algorithms have been developed to collect k-mer statistics in…

0301 basic medicineEpigenomicsgenomic analysis; hadoop; distributed computingStatistics and ProbabilityComputer scienceBig dataSequence assemblyGenomeBiochemistryDomain (software engineering)Set (abstract data type)03 medical and health sciencesdistributed computingSoftwareComputational Theory and MathematicAnimalsCluster AnalysisHumansA-DNAk-mer counting distributed computing hadoop map reduceMolecular BiologyEpigenomicsBacteriabusiness.industryk-mer countingEukaryotaLinguisticsComputer Science Applications1707 Computer Vision and Pattern RecognitionGenomicsSequence Analysis DNAComputer Science ApplicationsComputational Mathematics030104 developmental biologymap reduceComputational Theory and MathematicsDistributed algorithmgenomic analysisKernel (statistics)MetagenomehadoopbusinessAlgorithmAlgorithmsSoftware
researchProduct

An improved genome assembly uncovers prolific tandem repeats in Atlantic cod

2016

AbstractBackground: The first Atlantic cod (Gadus morhua) genome assembly published in 2011 was one of the early genome assemblies exclusively based on high-throughput 454 pyrosequencing. Since then, rapid advances in sequencing technologies have led to a multitude of assemblies generated for complex genomes, although many of these are of a fragmented nature with a significant fraction of bases in gaps. The development of long-read sequencing and improved software now enable the generation of more contiguous genome assemblies.Results: By combining data from Illumina, 454 and the longer PacBio sequencing technologies, as well as integrating the results of multiple assembly programs, we have …

0301 basic medicineHeterozygoteAssembly algorithmsSequence assemblyGenomicsRepetitive DNABiologyGenome03 medical and health sciences0302 clinical medicineAssembly consolidationTandem repeatIndel polymorphismGeneticsAnimalsGadusLong-read sequencing technologyPromoter Regions GeneticMicrosatellitesRepeated sequenceGenePacBioGeneticsHeterozygosityDinucleotide repeatsMolecular Sequence AnnotationGenomicsSequence Analysis DNAbiology.organism_classification030104 developmental biologyGadus morhuaTandem Repeat SequencesEvolutionary biologyPyrosequencingAtlantic cod030217 neurology & neurosurgeryResearch ArticleBiotechnology
researchProduct

Mycobacterium tuberculosis complex lineage 5 exhibits high levels of within-lineage genomic diversity and differing gene content compared to the type…

2021

Pathogens of theMycobacterium tuberculosiscomplex (MTBC) are considered to be monomorphic, with little gene content variation between strains. Nevertheless, several genotypic and phenotypic factors separate strains of the different MTBC lineages (L), especially L5 and L6 (traditionally termedMycobacterium africanum) strains, from each other. However, this genome variability and gene content, especially of L5 strains, has not been fully explored and may be important for pathobiology and current approaches for genomic analysis of MTBC strains, including transmission studies. By comparing the genomes of 355 L5 clinical strains (including 3 complete genomes and 352 Illumina whole-genome sequenc…

0301 basic medicineLineage (genetic)Genotype030106 microbiologySequence assemblyPathogens and Epidemiologylineage 5Genomegenomic diversity03 medical and health sciencesSpecies SpecificityDrug Resistance Multiple BacterialGenotypeHumansTuberculosisH37RvBiologyGeneResearch Articlesreference genomewithin-lineage variabilityGeneticsWhole Genome SequencingbiologyChromosome MappingGenetic VariationHigh-Throughput Nucleotide SequencingMycobacterium tuberculosisSequence Analysis DNAgene presence/absenceGeneral Medicinebiology.organism_classification030104 developmental biologyL5.3.2Mycobacterium tuberculosis complexM. africanumHuman medicineMycobacterium africanumGenome BacterialReference genomeMicrobial Genomics
researchProduct