Search results for "Data"

showing 10 items of 12992 documents

Predictive shelf life model based on RF technology for improving the management of food supply chain: A case study

2016

The aim of this paper was the development of a Smart Logistic Unit (SLU) based on RF technology to support the management of the food supply chain, in order to guarantee the shelf life of products in agreement with logistic efficiency and system sustainability. For this purpose, the main parameters that influence the quality of perishable products were determined and a shelf life equation based on Volatile Organic Compounds (VOCs) was modelled. The levels of VOCs were gathered by the sensors allocated inside the SLU, which configures as the remote element of a system for identification and data transmission. The proposed model was then validated through an experimental test, simulating the …

0301 basic medicineEngineeringmedia_common.quotation_subject030106 microbiologyRF technologyShelf lifeIndustrial and Manufacturing EngineeringManagement Information Systems03 medical and health sciencesRf technologyManagement of Technology and InnovationFood supplySettore ING-IND/17 - Impianti Industriali Meccanici0502 economics and businessQuality (business)Electrical and Electronic Engineeringmedia_commonbusiness.industry05 social sciencesReliability engineeringIdentification (information)SustainabilitySystems engineeringbusiness050203 business & managementSettore AGR/16 - Microbiologia AgrariaData transmissionInternational Journal of RF Technologies
researchProduct

Integration of animal health and public health surveillance sources to exhaustively inform the risk of zoonosis: An application to echinococcosis in …

2020

The analysis of zoonotic disease risk requires the consideration of both human and animal geo-referenced disease incidence data. Here we show an application of joint Bayesian analyses to the study of echinococcosis granulosus (EG) in the province of Rio Negro, Argentina. We focus on merging passive and active surveillance data sources of animal and human EG cases using joint Bayesian spatial and spatio-temporal models. While similar spatial clustering and temporal trending was apparent, there appears to be limited lagged dependence between animal and human outcomes. Beyond the data quality issues relating to missingness at different times, we were able to identify relations between dog and …

0301 basic medicineEpidemiologyRC955-962Animal DiseasesBayes' theoremMedical Conditions0302 clinical medicinePublic health surveillanceZoonosesArctic medicine. Tropical medicineEpidemiologyMedicine and Health SciencesPublic Health SurveillanceDog DiseasesChildEchinococcus granulosusMammalsCiencias Médicas y de la SaludDisease surveillanceSurveillancebiologyZoonosisEukaryotaEchinococcosisInfectious DiseasesGeographyHelminth InfectionsVertebratesPublic aspects of medicineRA1-1270Research ArticleNeglected Tropical Diseasesmedicine.medical_specialtyInfectious Disease ControlAdolescent030231 tropical medicineArgentinaDisease SurveillanceModels Biological03 medical and health sciencesDogsEchinococcosisEnvironmental healthControlParasitic DiseasesmedicineAnimalsHumansEchinococcus granulosusOrganismsPublic Health Environmental and Occupational HealthBiology and Life SciencesBayes TheoremTropical Diseasesmedicine.diseasebiology.organism_classification030104 developmental biologyEchinococosisMedical Risk FactorsInfectious Disease SurveillanceData qualityAmniotesZoology
researchProduct

Single-cell trajectories reconstruction, exploration and mapping of omics data with STREAM

2019

Single-cell transcriptomic assays have enabled the de novo reconstruction of lineage differentiation trajectories, along with the characterization of cellular heterogeneity and state transitions. Several methods have been developed for reconstructing developmental trajectories from single-cell transcriptomic data, but efforts on analyzing single-cell epigenomic data and on trajectory visualization remain limited. Here we present STREAM, an interactive pipeline capable of disentangling and visualizing complex branching trajectories from both single-cell transcriptomic and epigenomic data. We have tested STREAM on several synthetic and real datasets generated with different single-cell techno…

0301 basic medicineEpigenomicsMultifactor Dimensionality ReductionComputer scienceGeneral Physics and Astronomy02 engineering and technologyOmics dataMyoblastsMiceSingle-cell analysisGATA1 Transcription FactorMyeloid CellsLymphocyteslcsh:ScienceData processingMultidisciplinaryQGene Expression Regulation DevelopmentalRNA sequencingCell DifferentiationGenomics021001 nanoscience & nanotechnologyData processingDNA-Binding ProteinsInterferon Regulatory FactorsSingle-Cell Analysis0210 nano-technologyAlgorithmsOmics technologiesSignal TransductionLineage differentiationScienceComputational biologyGeneral Biochemistry Genetics and Molecular BiologyArticle03 medical and health sciencesErythroid CellsAnimalsCell LineageGeneral Chemistrydevelopmental trajectories visualizationHematopoietic Stem CellsPipeline (software)Visualization030104 developmental biologyTheoryofComputation_MATHEMATICALLOGICANDFORMALLANGUAGESCellular heterogeneitySingle cell analysilcsh:QGene expressionTranscriptomeTranscription FactorsNature Communications
researchProduct

Informational and linguistic analysis of large genomic sequence collections via efficient Hadoop cluster algorithms

2018

Abstract Motivation Information theoretic and compositional/linguistic analysis of genomes have a central role in bioinformatics, even more so since the associated methodologies are becoming very valuable also for epigenomic and meta-genomic studies. The kernel of those methods is based on the collection of k-mer statistics, i.e. how many times each k-mer in {A,C,G,T}k occurs in a DNA sequence. Although this problem is computationally very simple and efficiently solvable on a conventional computer, the sheer amount of data available now in applications demands to resort to parallel and distributed computing. Indeed, those type of algorithms have been developed to collect k-mer statistics in…

0301 basic medicineEpigenomicsgenomic analysis; hadoop; distributed computingStatistics and ProbabilityComputer scienceBig dataSequence assemblyGenomeBiochemistryDomain (software engineering)Set (abstract data type)03 medical and health sciencesdistributed computingSoftwareComputational Theory and MathematicAnimalsCluster AnalysisHumansA-DNAk-mer counting distributed computing hadoop map reduceMolecular BiologyEpigenomicsBacteriabusiness.industryk-mer countingEukaryotaLinguisticsComputer Science Applications1707 Computer Vision and Pattern RecognitionGenomicsSequence Analysis DNAComputer Science ApplicationsComputational Mathematics030104 developmental biologymap reduceComputational Theory and MathematicsDistributed algorithmgenomic analysisKernel (statistics)MetagenomehadoopbusinessAlgorithmAlgorithmsSoftware
researchProduct

Diversification of spatiotemporal expression and copy number variation of the echinoid hbox12/pmar1/micro1 multigene family

2017

Changes occurring during evolution in the cis-regulatory landscapes of individual members of multigene families might impart diversification in their spatiotemporal expression and function. The archetypal member of the echinoid hbox12/pmar1/micro1 family is hbox12-a, a homeobox-containing gene expressed exclusively by dorsal blastomeres, where it governs the dorsal/ventral gene regulatory network during embryogenesis of the sea urchin Paracentrotus lividus. Here we describe the inventory of the hbox12/pmar1/micro1 genes in P. lividus, highlighting that gene copy number variation occurs across individual sea urchins of the same species. We show that the various hbox12/pmar1/micro1 genes grou…

0301 basic medicineEvolutionary GeneticsEmbryologyGene regulatory networklcsh:MedicineGene ExpressionMedicine (all); Biochemistry Genetics and Molecular Biology (all); Agricultural and Biological Sciences (all)Database and Informatics MethodsGene duplicationGene Regulatory NetworksCopy-number variationlcsh:ScienceSea urchinPhylogenyMultidisciplinarybiologyPhylogenetic treeMedicine (all)Genes HomeoboxGene Expression Regulation DevelopmentalAnimal ModelsGenomicsExperimental Organism SystemsMultigene FamilySequence AnalysisResearch ArticleEchinodermsDNA Copy Number VariationsBioinformaticsDNA transcriptionZoologySettore BIO/11 - Biologia MolecolareResearch and Analysis MethodsParacentrotus lividus03 medical and health sciencesSequence Motif Analysisbiology.animalGeneticsGene familyAnimalsGeneEvolutionary BiologyBiochemistry Genetics and Molecular Biology (all)lcsh:REmbryosOrganismsBiology and Life SciencesComputational Biologybiology.organism_classificationGenome AnalysisGenomic LibrariesInvertebrates030104 developmental biologyAgricultural and Biological Sciences (all)Evolutionary biologySea Urchinslcsh:QSequence AlignmentDevelopmental Biology
researchProduct

iDamIDseq and iDEAR: an improved method and computational pipeline to profile chromatin-binding proteins

2016

DNA adenine methyltransferase identification (DamID) has emerged as an alternative method to profile protein-DNA interactions; however, critical issues limit its widespread applicability. Here, we present iDamIDseq, a protocol that improves specificity and sensitivity by inverting the steps DpnI-DpnII and adding steps that involve a phosphatase and exonuclease. To determine genome-wide protein-DNA interactions efficiently, we present the analysis tool iDEAR (iDamIDseq Enrichment Analysis with R). The combination of DamID and iDEAR permits the establishment of consistent profiles for transcription factors, even in transient assays, as we exemplify using the small teleost medaka (Oryzias lati…

0301 basic medicineExonucleaseSite-Specific DNA-Methyltransferase (Adenine-Specific)Embryo NonmammalianOryziasOryziasComputational biologyBiology03 medical and health scienceschemistry.chemical_compoundTechniques and ResourcesTranscriptional regulationDatabases GeneticProtein Interaction MappingTranscriptional regulationAnimalsEpigeneticsPromoter Regions GeneticMolecular BiologyTranscription factorGeneticsBinding SitesChromatin bindingComputational BiologyPromoterSequence Analysis DNADNA Methylationbiology.organism_classificationChromatinDNA-Binding Proteins030104 developmental biologychemistryGene Expression Regulation207Chromatin profilingbiology.proteinDamIDEpigeneticsTranscription factorDNAAlgorithmsDevelopmental BiologyProtein BindingTranscription FactorsDevelopment (Cambridge, England)
researchProduct

FASTdoop: A versatile and efficient library for the input of FASTA and FASTQ files for MapReduce Hadoop bioinformatics applications

2017

Abstract Summary MapReduce Hadoop bioinformatics applications require the availability of special-purpose routines to manage the input of sequence files. Unfortunately, the Hadoop framework does not provide any built-in support for the most popular sequence file formats like FASTA or BAM. Moreover, the development of these routines is not easy, both because of the diversity of these formats and the need for managing efficiently sequence datasets that may count up to billions of characters. We present FASTdoop, a generic Hadoop library for the management of FASTA and FASTQ files. We show that, with respect to analogous input management routines that have appeared in the Literature, it offers…

0301 basic medicineFASTQ formatStatistics and ProbabilityComputer scienceSequence analysismedia_common.quotation_subjectInformation Storage and RetrievalBioinformaticscomputer.software_genreGenomeBiochemistryDomain (software engineering)03 medical and health sciencesComputational Theory and MathematicHumansGenomic libraryQuality (business)DNA sequencingFASTQ; NGS; FASTQ; DNA sequencingMolecular Biologymedia_commonGene LibrarySequenceDatabaseSettore INF/01 - InformaticaGenome HumanComputer Science Applications1707 Computer Vision and Pattern RecognitionGenomicsSequence Analysis DNAFASTQFile formatComputer Science ApplicationsStatistics and Probability; Biochemistry; Molecular Biology; Computer Science Applications1707 Computer Vision and Pattern Recognition; Computational Theory and Mathematics; Computational MathematicsComputational Mathematics030104 developmental biologyComputational Theory and MathematicsNGSDatabase Management Systemscomputer
researchProduct

Detecting mutations by eBWT

2018

In this paper we develop a theory describing how the extended Burrows-Wheeler Transform (eBWT) of a collection of DNA fragments tends to cluster together the copies of nucleotides sequenced from a genome G. Our theory accurately predicts how many copies of any nucleotide are expected inside each such cluster, and how an elegant and precise LCP array based procedure can locate these clusters in the eBWT. Our findings are very general and can be applied to a wide range of different problems. In this paper, we consider the case of alignment-free and reference-free SNPs discovery in multiple collections of reads. We note that, in accordance with our theoretical results, SNPs are clustered in th…

0301 basic medicineFOS: Computer and information sciences000 Computer science knowledge general worksBWT LCP Array SNPs Reference-free Assembly-freeLCP ArraySettore INF/01 - Informatica[SDV]Life Sciences [q-bio]Reference-freeAssembly-freeSNP03 medical and health sciences030104 developmental biologyBWTBWT; LCP Array; SNPs; Reference-free; Assembly-freeComputer ScienceComputer Science - Data Structures and AlgorithmsData Structures and Algorithms (cs.DS)[INFO]Computer Science [cs]SoftwareSNPs
researchProduct

The colored longest common prefix array computed via sequential scans

2018

Due to the increased availability of large datasets of biological sequences, the tools for sequence comparison are now relying on efficient alignment-free approaches to a greater extent. Most of the alignment-free approaches require the computation of statistics of the sequences in the dataset. Such computations become impractical in internal memory when very large collections of long sequences are considered. In this paper, we present a new conceptual data structure, the colored longest common prefix array (cLCP), that allows to efficiently tackle several problems with an alignment-free approach. In fact, we show that such a data structure can be computed via sequential scans in semi-exter…

0301 basic medicineFOS: Computer and information sciencesAlignment-free methodsBurrows–Wheeler transformComputer scienceComputationAverage common substring0206 medical engineeringMatching statisticsScale (descriptive set theory)02 engineering and technologyTheoretical Computer Science03 medical and health sciencesComputer Science - Data Structures and AlgorithmsData Structures and Algorithms (cs.DS)Burrows-wheeler transformString (computer science)Computer Science (all)LCP arrayMatching statisticData structureSubstring030104 developmental biologyAlignment-free methods; Average common substring; Burrows-wheeler transform; Longest common prefix; Matching statistics; Theoretical Computer Science; Computer Science (all)Pairwise comparisonLongest common prefixAlgorithm020602 bioinformaticsAlignment-free method
researchProduct

Q-nexus: a comprehensive and efficient analysis pipeline designed for ChIP-nexus

2016

Background: ChIP-nexus, an extension of the ChIP-exo protocol, can be used to map the borders of protein-bound DNA sequences at nucleotide resolution, requires less input DNA and enables selective PCR duplicate removal using random barcodes. However, the use of random barcodes requires additional preprocessing of the mapping data, which complicates the computational analysis. To date, only a very limited number of software packages are available for the analysis of ChIP-exo data, which have not yet been systematically tested and compared on ChIP-nexus data. Results: Here, we present a comprehensive software package for ChIP-nexus data that exploits the random barcodes for selective removal …

0301 basic medicineFOS: Computer and information sciencesDuplication ratesChromatin ImmunoprecipitationBioinformaticsPipeline (computing)610Biologycomputer.software_genre600 Technik Medizin angewandte Wissenschaften::610 Medizin und Gesundheit03 medical and health sciencesSoftwareChIP-nexusGeneticsPreprocessorNucleotide MotifsLibrary complexityChIP-exoGeneticsProtocol (science)Binding Sitesbusiness.industryfungiComputational BiologyHigh-Throughput Nucleotide SequencingReproducibility of ResultsChipChromatin immunoprecipitationData mappingDNA-Binding ProteinsAlgorithm030104 developmental biologyChIP-exoData miningbusinessPeak callingcomputerAlgorithmsSoftwareProtein BindingTranscription FactorsResearch ArticleBiotechnologyBMC Genomics
researchProduct