Search results for "Informatics"

showing 10 items of 2542 documents

Analyzing big datasets of genomic sequences: fast and scalable collection of k-mer statistics

2019

Abstract Background Distributed approaches based on the MapReduce programming paradigm have started to be proposed in the Bioinformatics domain, due to the large amount of data produced by the next-generation sequencing techniques. However, the use of MapReduce and related Big Data technologies and frameworks (e.g., Apache Hadoop and Spark) does not necessarily produce satisfactory results, in terms of both efficiency and effectiveness. We discuss how the development of distributed and Big Data management technologies has affected the analysis of large datasets of biological sequences. Moreover, we show how the choice of different parameter configurations and the careful engineering of the …

Data AnalysisFOS: Computer and information sciencesTime FactorsTime FactorComputer scienceStatistics as TopicBig dataApache Spark; distributed computing; performance evaluation; k-mer countinglcsh:Computer applications to medicine. Medical informaticsBiochemistryDomain (software engineering)Databases03 medical and health sciences0302 clinical medicineStructural BiologyComputer clusterStatisticsSpark (mathematics)Molecular Biologylcsh:QH301-705.5030304 developmental biology0303 health sciencesGenomeSettore INF/01 - InformaticaBase SequenceNucleic AcidApache Sparkbusiness.industryResearchApache Spark; Distributed computing; k-mer counting; Performance evaluation; Algorithms; Base Sequence; Software; Time Factors; Data Analysis; Databases Nucleic Acid; Genome; Statistics as TopicApplied Mathematicsk-mer countingDistributed computingComputer Science ApplicationsAlgorithmData AnalysiComputer Science - Distributed Parallel and Cluster Computinglcsh:Biology (General)030220 oncology & carcinogenesisScalabilityPerformance evaluationlcsh:R858-859.7Algorithm designDistributed Parallel and Cluster Computing (cs.DC)Databases Nucleic AcidbusinessAlgorithmsSoftware

researchProduct

Modeling crowd dynamics through coarse-grained data analysis

2018

International audience; Understanding and predicting the collective behaviour of crowds is essential to improve the efficiency of pedestrian flows in urban areas and minimize the risks of accidents at mass events. We advocate for the development of crowd traffic management systems, whereby observations of crowds can be coupled to fast and reliable models to produce rapid predictions of the crowd movement and eventually help crowd managers choose between tailored optimization strategies. Here, we propose a Bi-directional Macroscopic (BM) model as the core of such a system. Its key input is the fundamental diagram for bi-directional flows, i.e. the relation between the pedestrian fluxes and d…

Data AnalysisOperations researchComputer scienceFLOW[INFO.INFO-GR] Computer Science [cs]/Graphics [cs.GR]macroscopic model0904 Chemical EngineeringTransportation02 engineering and technologycomputer.software_genre01 natural sciences010305 fluids & plasmas[SHS]Humanities and Social Sciences[SCCO]Cognitive scienceCrowds0903 Biomedical Engineering0102 Applied Mathematics11. Sustainability0202 electrical engineering electronic engineering information engineeringCluster AnalysisApplied Mathematicsbi-directional fluxcollective behaviourGeneral Medicine[INFO.INFO-GR]Computer Science [cs]/Graphics [cs.GR]Computational MathematicsCore (game theory)Modeling and Simulation[SCCO.PSYC]Cognitive science/Psychology020201 artificial intelligence & image processingGeneral Agricultural and Biological SciencesLife Sciences & BiomedicineBEHAVIORCrowd dynamicsRelation (database)Bioinformatics[MATH.MATH-DS]Mathematics [math]/Dynamical Systems [math.DS]BioengineeringPedestrianModels PsychologicalMachine learningAdvanced Traffic Management SystemPedestrian traffic0103 physical sciencesHumansComputer Simulation[NLIN.NLIN-AO]Nonlinear Sciences [physics]/Adaptation and Self-Organizing Systems [nlin.AO]Block (data storage)Science & Technologybusiness.industryMathematical ConceptsSIMULATIONSdata-based modelingCrowdingKey (cryptography)Artificial intelligenceMathematical & Computational Biologybusinesscomputer

researchProduct

Controlling false match rates in record linkage using extreme value theory

2011

AbstractCleansing data from synonyms and homonyms is a relevant task in fields where high quality of data is crucial, for example in disease registries and medical research networks. Record linkage provides methods for minimizing synonym and homonym errors thereby improving data quality. We focus our attention to the case of homonym errors (in the following denoted as ‘false matches’), in which records belonging to different entities are wrongly classified as equal. Synonym errors (‘false non-matches’) occur when a single entity maps to multiple records in the linkage result. They are not considered in this study because in our application domain they are not as crucial as false matches. Fa…

Data cleansingData cleansingBiomedical ResearchDatabases FactualCalibration (statistics)Computer scienceHealth Informaticscomputer.software_genrePlot (graphics)Mean excess plotStatisticsRegistriesExtreme value theoryLinkage (software)Models StatisticalComputational BiologyFellegi–Sunter modelMixture modelGeneralized Pareto distributionComputer Science ApplicationsData qualityStatistics of extreme valuesDatabase Management SystemsMedical Record LinkageData miningcomputerAlgorithmsMedical InformaticsRecord linkageJournal of Biomedical Informatics

researchProduct

EHRtemporalVariability

2020

Functions to delineate temporal dataset shifts in Electronic Health Records through the projection and visualization of dissimilarities among data temporal batches. This is done through the estimation of data statistical distributions over time and their projection in non-parametric statistical manifolds, uncovering the patterns of the data latent temporal variability. EHRtemporalVariability is particularly suitable for multi-modal data and categorical variables with a high number of values, common features of biomedical data where traditional statistical process control or time-series methods may not be appropriate. EHRtemporalVariability allows you to explore and identify dataset shifts t…

Data quality managementMedical informaticsBioinformaticsMachine learningData visualisation

researchProduct

VegItaly: Technical features, crucial issues and some solutions

2012

VegItaly is at present the largest Italian vegetation database. It is the result of a collaborative project aspiring to represent a major reference for the Italian vegetation scientists. The paper emphasizes its benefits for phytosociological data management and describes the solutions adopted to solve several technical problems, like the treatment of different vegetation stratification systems, the conversion of vegetation cover values, taxonomic and syntaxonomic issues, data import and access. The structure of the taxonomic list produced to support the storing of data is described. It allows an easy management of synonymic relationships and is constantly updated according to new publicati…

Data sharing; Ecoinformatics; Italian vegetation database; Phytosociology; Syntaxonomy; Taxonomic listVegetation plot; Forestry; Ecology Evolution Behavior and Systematics; Ecology; Plant ScienceEcologyEvolutionDATABASEdata sharingItalian vegetation databasephytosociologyForestrydata sharing ecoinformatics Italian vegetation database phytosociology syntaxonomy taxonomic list vegetation plotPlant Scienceecoinformaticstaxonomic listBehavior and Systematicsvegetation sciencedata sharing; ecoinformatics; italian vegetation database; phytosociology; syntaxonomy; taxonomic list; taxonomic listvegetation plot; vegetation plotSettore BIO/03 - Botanica Ambientale E Applicatasyntaxonomytaxonomic listvegetation plotvegetation plot

researchProduct

Technology-Supported Guidance Models Stimulating the Development of Critical Thinking in Clinical Practice: Protocol for a Mixed Methods Systematic R…

2020

BackgroundCritical thinking is an essential skill that nursing students need to develop. Technological tools have opened new avenues for technology-supported guidance models, but the challenges and facilitators of such guidance models, as well as how they stimulate the development of critical thinking, remain unclear.ObjectiveWe developed a protocol for a mixed methods systematic review to investigate the use of technology-supported guidance models that stimulate the development of critical thinking in nursing education clinical practice.MethodsA convergent integrated design following the Joanna Briggs Institute Manual for Evidence Synthesis will be employed. A pair of authors will select t…

Data transformationComputer applications to medicine. Medical informaticsguidance modelsR858-859.703 medical and health sciences0302 clinical medicineProtocolcritical thinking030212 general & internal medicineNurse educationProtocol (science)Medical education030504 nursingClinical study designnursing educationRGeneral Medicineclinical practiceClinical PracticeCritical thinkingData extractiontechnologyMedicine0305 other medical sciencePsychologyEvidence synthesisJMIR Research Protocols

researchProduct

Assessing the format and content of journal published and non-journal published rapid review reports: A comparative study

2020

Background As production of rapid reviews (RRs) increases in healthcare, knowing how to efficiently convey RR evidence to various end-users is important given they are often intended to directly inform decision-making. Little is known about how often RRs are produced in the published or unpublished domains, and what and how information is structured. Objectives To compare and contrast report format and content features of journal-published (JP) and non-journal published (NJP) RRs. Methods JP RRs were identified from key databases, and NJP RRs were identified from a grey literature search of 148 RR producing organizations and were sampled proportionate to cluster size by organization and pro…

Databases Factual ; Peer Review Research ; PublishingMedical JournalsStructural EngineeringDatabases FactualSciencePeer ReviewDecision MakingMEDLINESocial SciencesResearch and Analysis Methods03 medical and health sciencesDatabase and Informatics Methods0302 clinical medicineCognitionMedicine and Health SciencesPsychology030212 general & internal medicineDatabase SearchingScientific PublishingLanguagePeer Review ResearchPublishingMultidisciplinaryInformation retrievalExecutive summaryHealth Care Policy030503 health policy & servicesQRCognitive PsychologyBiology and Life SciencesGrey literatureResearch AssessmentBuilt StructuresFull ProtocolHealth CareCluster sizeMedicineCognitive ScienceEngineering and Technology0305 other medical sciencePsychologyMedical HumanitiesResearch ArticleNeurosciencePLoS ONE

researchProduct

PyCellBase, an efficient python package for easy retrieval of biological data from heterogeneous sources.

2019

Background Biological databases and repositories are incrementing in diversity and complexity over the years. This rapid expansion of current and new sources of biological knowledge raises serious problems of data accessibility and integration. To handle the growing necessity of unification, CellBase was created as an integrative solution. CellBase provides a centralized NoSQL database containing biological information from different and heterogeneous sources. Access to this information is done through a RESTful web service API, which provides an efficient interface to the data. Results In this work we present PyCellBase, a Python package that provides programmatic access to the rich RESTfu…

Databases FactualComputer scienceAnnotationBiological databaseRESTfulcomputer.software_genreNoSQLlcsh:Computer applications to medicine. Medical informaticsBiochemistryDatabase03 medical and health sciencesAnnotationUser-Computer Interface0302 clinical medicineInstallationStructural BiologyVariantMolecular Biologylcsh:QH301-705.5030304 developmental biologycomputer.programming_language0303 health sciencesBiological dataDatabaseApplied MathematicsRepositoryComputational BiologyPython (programming language)CellBaseComputer Science Applicationslcsh:Biology (General)Scripting language030220 oncology & carcinogenesislcsh:R858-859.7Web servicecomputerSoftwarePython

researchProduct

Analysis of Lipid Experiments (ALEX): A Software Framework for Analysis of High-Resolution Shotgun Lipidomics Data

2013

Global lipidomics analysis across large sample sizes produces high-content datasets that require dedicated software tools supporting lipid identification and quantification, efficient data management and lipidome visualization. Here we present a novel software-based platform for streamlined data processing, management and visualization of shotgun lipidomics data acquired using high-resolution Orbitrap mass spectrometry. The platform features the ALEX framework designed for automated identification and export of lipid species intensity directly from proprietary mass spectral data files, and an auxiliary workflow using database exploration tools for integration of sample information, computat…

Databases FactualComputer scienceData managementlcsh:MedicineBioinformaticscomputer.software_genreMass spectrometryMiceUser-Computer InterfaceData visualizationLipidomicsAnimalslcsh:ScienceInternetMultidisciplinarybusiness.industrylcsh:RBrainLipid-phosphate phosphataseShotgun lipidomicsLipidomeLipidsVisualizationSoftware frameworkKnockout mouselcsh:QData miningbusinesscomputerSoftwareResearch ArticlePLoS ONE

researchProduct

Convolutional Neural Network With Shape Prior Applied to Cardiac MRI Segmentation.

2019

In this paper, we present a novel convolutional neural network architecture to segment images from a series of short-axis cardiac magnetic resonance slices (CMRI). The proposed model is an extension of the U-net that embeds a cardiac shape prior and involves a loss function tailored to the cardiac anatomy. Since the shape prior is computed offline only once, the execution of our model is not limited by its calculation. Our system takes as input raw magnetic resonance images, requires no manual preprocessing or image cropping and is trained to segment the endocardium and epicardium of the left ventricle, the endocardium of the right ventricle, as well as the center of the left ventricle. Wit…

Databases FactualComputer scienceHealth InformaticsImage processingConvolutional neural network030218 nuclear medicine & medical imaging03 medical and health sciences0302 clinical medicineHealth Information ManagementSørensen–Dice coefficientImage Processing Computer-AssistedHumansElectrical and Electronic EngineeringArtificial neural networkbusiness.industryMedical image computingCenter (category theory)Pattern recognitionHeartImage segmentationMagnetic Resonance ImagingComputer Science ApplicationsCardiac Imaging TechniquesHausdorff distancecardiovascular systemArtificial intelligenceNeural Networks Computerbusiness030217 neurology & neurosurgeryIEEE journal of biomedical and health informatics

researchProduct