Search results for " sequencing"

showing 10 items of 976 documents

AnySeq: A High Performance Sequence Alignment Library based on Partial Evaluation

2020

Sequence alignments are fundamental to bioinformatics which has resulted in a variety of optimized implementations. Unfortunately, the vast majority of them are hand-tuned and specific to certain architectures and execution models. This not only makes them challenging to understand and extend, but also difficult to port to other platforms. We present AnySeq - a novel library for computing different types of pairwise alignments of DNA sequences. Our approach combines high performance with an intuitively understandable implementation, which is achieved through the concept of partial evaluation. Using the AnyDSL compiler framework, AnySeq enables the compilation of algorithmic variants that ar…

FOS: Computer and information sciences0301 basic medicineScheme (programming language)Computer Science - PerformanceComputer science0206 medical engineeringSequence alignment02 engineering and technologyParallel computingcomputer.software_genreMetaprogrammingDNA sequencingPartial evaluationPerformance (cs.PF)03 medical and health sciences030104 developmental biologyComputer Science - Distributed Parallel and Cluster ComputingFunction composition (computer science)MultithreadingDistributed Parallel and Cluster Computing (cs.DC)Compilercomputer020602 bioinformaticscomputer.programming_languageCodebase
researchProduct

Helminth Microbiota Profiling Using Bacterial 16S rRNA Gene Amplicon Sequencing: From Sampling to Sequence Data Mining

2021

Symbiont microbial communities play important roles in animal biology and are thus considered integral components of metazoan organisms, including parasitic worms (helminths). Nevertheless, the study of helminth microbiomes has thus far been largely overlooked, and symbiotic relationships between helminths and their microbiomes have been only investigated in selected parasitic worms. Over the past decade, advances in next-generation sequencing technologies, coupled with their increased affordability, have spurred investigations of helminth-associated microbial communities aiming at enhancing current understanding of their fundamental biology and physiology, as well as of host-microbe intera…

FOS: Computer and information sciencesBioinformaticsComputational biologyBiologyDNA sequencingSymbiosisHelminthsRNA Ribosomal 16Sparasitic diseasesHelminthAnimalsData MiningHelminthsMicrobiomeGeneBacterial 16S rRNA geneIndirect life cycleHigh-throughput sequencingMicrobiotaHigh-Throughput Nucleotide SequencingGenes rRNASchistosoma mansoniAmplicon sequencingHuman genomeSample collectionWorm-associated microbiome
researchProduct

Large-scale compression of genomic sequence databases with the Burrows-Wheeler transform

2012

Motivation The Burrows-Wheeler transform (BWT) is the foundation of many algorithms for compression and indexing of text data, but the cost of computing the BWT of very large string collections has prevented these techniques from being widely applied to the large sets of sequences often encountered as the outcome of DNA sequencing experiments. In previous work, we presented a novel algorithm that allows the BWT of human genome scale data to be computed on very moderate hardware, thus enabling us to investigate the BWT as a tool for the compression of such datasets. Results We first used simulated reads to explore the relationship between the level of compression and the error rate, the leng…

FOS: Computer and information sciencesStatistics and ProbabilityBurrows–Wheeler transformComputer scienceData_CODINGANDINFORMATIONTHEORYBurrows-Wheeler transformcomputer.software_genreBiochemistryBurrows-Wheeler transform; Data Compression; Next-generation sequencingComputer Science - Data Structures and AlgorithmsEscherichia coliCode (cryptography)HumansOverhead (computing)Data Structures and Algorithms (cs.DS)Computer SimulationQuantitative Biology - GenomicsMolecular BiologyGenomics (q-bio.GN)Genome HumanString (computer science)Search engine indexingSortingGenomicsSequence Analysis DNAConstruct (python library)Data CompressionComputer Science ApplicationsComputational MathematicsComputational Theory and MathematicsFOS: Biological sciencesNext-generation sequencingData miningDatabases Nucleic AcidcomputerAlgorithmsData compression
researchProduct

Confidence-based Somatic Mutation Evaluation and Prioritization

2012

Next generation sequencing (NGS) has enabled high throughput discovery of somatic mutations. Detection depends on experimental design, lab platforms, parameters and analysis algorithms. However, NGS-based somatic mutation detection is prone to erroneous calls, with reported validation rates near 54% and congruence between algorithms less than 50%. Here, we developed an algorithm to assign a single statistic, a false discovery rate (FDR), to each somatic mutation identified by NGS. This FDR confidence value accurately discriminates true mutations from erroneous calls. Using sequencing data generated from triplicate exome profiling of C57BL/6 mice and B16-F10 melanoma cells, we used the exist…

False discovery rateSequence analysisSomatic cellQH301-705.5Low ConfidenceDNA Mutational AnalysisBiologySensitivity and SpecificityDNA sequencing03 medical and health sciencesCellular and Molecular NeuroscienceMice0302 clinical medicineGermline mutationGenetic MutationGeneticsAnimalsExomeFalse Positive ReactionsGenome SequencingBiology (General)Molecular BiologyExomeBiologyMelanomaEcology Evolution Behavior and SystematicsHealth aging / healthy living Cardiovascular diseases [IGMD 5]030304 developmental biologyGenetics0303 health sciencesEcologyReceiver operating characteristicComputational BiologyReproducibility of ResultsGenomicsDNA NeoplasmSequence Analysis DNAMice Inbred C57BLComputational Theory and Mathematics030220 oncology & carcinogenesisModeling and SimulationMutationArtifactsResearch Article
researchProduct

RNA sequencing-based transcriptome profiling of cardiac tissue Implicados novela putative disease mechanisms in FLNC-associated arrhythmogenic cardio…

2020

Arrhythmogenic cardiomyopathy (ACM) encompasses a group of inherited cardiomyopathies including arrhythmogenic right ventricular cardiomyopathy (ARVC) whose molecular disease mechanism is associated with dysregulation of the canonical WNT signalling pathway. Recent evidence indicates that ARVC and ACM caused by pathogenic variants in the FLNC gene encoding filamin C, a major cardiac structural protein, may have different molecular mechanisms of pathogenesis. We sought to identify dysregulated biological pathways in FLNC-associated ACM. RNA was extracted from seven paraffin-embedded left ventricular tissue samples from deceased ACM patients carrying FLNC variants and sequenced. Transcript le…

FilaminsDNA Mutational Analysis030204 cardiovascular system & hematologyGene mutationFilaminArticleTranscriptome03 medical and health sciences0302 clinical medicineHumansMedicineGenetic Predisposition to Disease030212 general & internal medicineJAM2FLNCGeneArrhythmogenic Right Ventricular Dysplasiabusiness.industryGene Expression ProfilingDNAArrhythmogenic cardiomyopathy Filamin C Focal adhesion pathway Integrin linked kinase pathway RNA sequencingActin cytoskeletonPatologiaCell biologyPhenotypeMutationCardiology and Cardiovascular MedicinebusinessMYL7
researchProduct

AnABlast: Re-searching for Protein-Coding Sequences in Genomic Regions

2019

AnABlast is a computational tool that highlights protein-coding regions within intergenic and intronic DNA sequences which escape detection by standard gene prediction algorithms. DNA sequences with small protein-coding genes or exons, complex intron-containing genes, or degenerated DNA fragments are efficiently targeted by AnABlast. Furthermore, this algorithm is particularly useful in detecting protein-coding sequences with nonsignificant homologs to sequences in databases. AnABlast can be executed online at http://www.bioinfocabd.upo.es/anablast/ .

Fossil DNA sequencesProtein coding0303 health sciencesGene predictionCoding DNA sequences030302 biochemistry & molecular biologyComputational biologyBiologyGene findingDNA sequencing03 medical and health sciencesExonchemistry.chemical_compoundIntergenic regionchemistryHomologous chromosomeSmall genesGeneIn silico annotation toolDNA030304 developmental biology
researchProduct

Gene structure and hemocyanin isoform HtH2 from the mollusc Haliotis tuberculata indicate early and late intron hot spots.

2002

Abstract We have cloned and sequenced cDNAs coding for the complete primary structure of HtH2, the second hemocyanin isoform of the marine gastropod Haliotis tuberculata. The deduced protein sequence comprises 3399 amino acids, corresponding to a molecular mass of 392 kDa. It shares only 66% of structural identity with the previously analysed first isoform HtH1, and according to a molecular clock, the two isoforms of Haliotis hemocyanin separated ca. 320 million years ago. By genomic polymerase chain reaction and 5′ race, we have also sequenced the complete gene of HtH2 (18,598 bp), except of the 5′ region in front of the secreted protein. It encompasses 15 exons and 14 introns and shows se…

Gene isoformDNA ComplementaryTime Factorsmedicine.medical_treatmentProtein subunitMolecular Sequence DataBiologyEvolution MolecularExonProtein sequencingGeneticsmedicineAnimalsProtein IsoformsAmino Acid SequenceGeneGeneticsBase SequenceSequence Homology Amino AcidProtein primary structureIntronHemocyaninGeneral MedicineDNAExonsSequence Analysis DNAIntronsGenesMolluscaHemocyaninsSequence AlignmentGene
researchProduct

The metal binding abilities of Megathura crenulata metallothionein (McMT) in the frame of gastropoda MTs.

2011

Metallothioneins (MTs) are proteins that play a major role in metal homeostasis and/or detoxification in all kind of organisms. The MT gene/protein system of gastropod molluscs provides an invaluable model to study the diversification mechanisms that have enabled MTs to achieve metal-binding specificity through evolution. Most pulmonate gastropods, particularly terrestrial snails, harbor three paralogous isogenes encoding three MT isoforms with different metal binding preferences: the highly specific CdMT and CuMT isoforms, for cadmium and copper respectively, and the unspecific Cd/CuMT isoform. Megathura crenulata is a non-pulmonate gastropod in which only one MT isogene has so far been re…

Gene isoformSpectrometry Mass Electrospray Ionizationanimal structuresGastropodaPeptidePlasma protein bindingMegathura crenulataBiochemistryInorganic ChemistryProtein sequencingGastropodaMetallothioneinAnimalsGenechemistry.chemical_classificationbiologyChemistryEcologybiology.organism_classificationZincBiochemistryMetalsMetallothioneinCopperCadmiumProtein BindingJournal of inorganic biochemistry
researchProduct

Disparity between Inter-Patient Molecular Heterogeneity and Repertoires of Target Drugs Used for Different Types of Cancer in Clinical Oncology

2020

Inter-patient molecular heterogeneity is the major declared driver of an expanding variety of anticancer drugs and personalizing their prescriptions. Here, we compared interpatient molecular heterogeneities of tumors and repertoires of drugs or their molecular targets currently in use in clinical oncology. We estimated molecular heterogeneity using genomic (whole exome sequencing) and transcriptomic (RNA sequencing) data for 4890 tumors taken from The Cancer Genome Atlas database. For thirteen major cancer types, we compared heterogeneities at the levels of mutations and gene expression with the repertoires of targeted therapeutics and their molecular targets accepted by the current guideli…

Gene mutationMedical OncologychemotherapyGenomeTranscriptomelcsh:ChemistryDrug Delivery SystemsProstateNeoplasmstumor heterogeneityMedicineCluster AnalysisMolecular Targeted TherapyPathology MolecularPrecision Medicinelcsh:QH301-705.5targeted therapeuticscancer drugsSpectroscopyExome sequencingGeneral MedicineGenomicspersonalized medicineComputer Science ApplicationsDrug repositioningmedicine.anatomical_structureAntineoplastic AgentsComputational biologyCatalysisArticleInorganic Chemistrymolecular diagnosticsGenetic HeterogeneityDrug TherapyExome SequencingHumansPhysical and Theoretical ChemistryMolecular Biologygenomeclinical oncologybusiness.industryOrganic ChemistryMolecular diagnosticsmutationslcsh:Biology (General)lcsh:QD1-999MutationPersonalized medicinebusinesstranscriptomeInternational Journal of Molecular Sciences
researchProduct

Next-Generation Sequencing: Application in Liver Cancer—Past, Present and Future?

2012

Hepatocellular Carcinoma (HCC) is the third most deadly malignancy worldwide characterized by phenotypic and molecular heterogeneity. In the past two decades, advances in genomic analyses have formed a comprehensive understanding of different underlying pathobiological layers resulting in hepatocarcinogenesis. More recently, improvements of sophisticated next-generation sequencing (NGS) technologies have enabled complete and cost-efficient analyses of cancer genomes at a single nucleotide resolution and advanced into valuable tools in translational medicine. Although the use of NGS in human liver cancer is still in its infancy, great promise rests in the systematic integration of different …

General Immunology and MicrobiologyNext-generation sequencing (NGS)business.industryTranslational medicineCancerGenomicsReviewpersonalized medicineBiologyBioinformaticsmedicine.diseaseMalignancyGeneral Biochemistry Genetics and Molecular BiologyDNA sequencingintegrative genomicslcsh:Biology (General)medicinePersonalized medicineHepatocellular carcinoma (HCC)General Agricultural and Biological SciencesLiver cancerbusinesslcsh:QH301-705.5EpigenomicsBiology
researchProduct