Search results for "Database Management"

showing 10 items of 26 documents

FASTdoop: A versatile and efficient library for the input of FASTA and FASTQ files for MapReduce Hadoop bioinformatics applications

2017

Abstract Summary MapReduce Hadoop bioinformatics applications require the availability of special-purpose routines to manage the input of sequence files. Unfortunately, the Hadoop framework does not provide any built-in support for the most popular sequence file formats like FASTA or BAM. Moreover, the development of these routines is not easy, both because of the diversity of these formats and the need for managing efficiently sequence datasets that may count up to billions of characters. We present FASTdoop, a generic Hadoop library for the management of FASTA and FASTQ files. We show that, with respect to analogous input management routines that have appeared in the Literature, it offers…

0301 basic medicineFASTQ formatStatistics and ProbabilityComputer scienceSequence analysismedia_common.quotation_subjectInformation Storage and RetrievalBioinformaticscomputer.software_genreGenomeBiochemistryDomain (software engineering)03 medical and health sciencesComputational Theory and MathematicHumansGenomic libraryQuality (business)DNA sequencingFASTQ; NGS; FASTQ; DNA sequencingMolecular Biologymedia_commonGene LibrarySequenceDatabaseSettore INF/01 - InformaticaGenome HumanComputer Science Applications1707 Computer Vision and Pattern RecognitionGenomicsSequence Analysis DNAFASTQFile formatComputer Science ApplicationsStatistics and Probability; Biochemistry; Molecular Biology; Computer Science Applications1707 Computer Vision and Pattern Recognition; Computational Theory and Mathematics; Computational MathematicsComputational Mathematics030104 developmental biologyComputational Theory and MathematicsNGSDatabase Management Systemscomputer
researchProduct

BlotBase: a northern blot database.

2008

With the availability of high-throughput gene expression analysis, multiple public expression databases emerged, mostly based on microarray expression data. Although these databases are of significant biomedical value, they do hold significant drawbacks, especially concerning the reliability of single gene expression profiles obtained by microarray data. Simultaneously, reliable data on an individual gene's expression are often published as single northern blots in individual publications. These data were not yet available for high-throughput screening. To reduce the gap between high-throughput expression data and individual highly reliable expression data, we designed a novel database "Blo…

Bar chartHUGO Gene Nomenclature CommitteeValue (computer science)Information Storage and RetrievalBiologycomputer.software_genrePolymerase Chain Reactionlaw.inventionMicelawGeneticsComputer GraphicsMicroarray databasesAnimalsHumansNorthern blotDatabases ProteinDNA PrimersInternetDatabaseMicroarray analysis techniquesSequence Analysis RNAGene Expression ProfilingFull text searchComputational BiologyGeneral MedicineBlotting NorthernGene expression profilingDatabase Management SystemscomputerSoftwareGene
researchProduct

EVpedia: a community web portal for extracellular vesicles research

2014

Abstract Motivation: Extracellular vesicles (EVs) are spherical bilayered proteolipids, harboring various bioactive molecules. Due to the complexity of the vesicular nomenclatures and components, online searches for EV-related publications and vesicular components are currently challenging. Results: We present an improved version of EVpedia, a public database for EVs research. This community web portal contains a database of publications and vesicular components, identification of orthologous vesicular components, bioinformatic tools and a personalized function. EVpedia includes 6879 publications, 172 080 vesicular components from 263 high-throughput datasets, and has been accessed more tha…

Biomedical ResearchDatabases FactualComputer scienceBioactive moleculesMedizinBioinformaticsBiochemistryMathematical SciencesUser-Computer InterfaceNon-U.S. Gov'tdatabasecomputer.programming_languagePLASMAMICROPARTICLESResearch Support Non-U.S. Gov'tbioinformaticsBiological SciencesOriginal PapersCANCERComputer Science ApplicationsIdentification (information)Cell and molecular biologyComputational MathematicsComputational Theory and MathematicsPROTEOMIC ANALYSISMEMBRANE-VESICLESEXPRESSIONStatistics and ProbabilityPROSTASOMESJavaBioinformaticsexosomesResearch SupportExtracellular vesiclesWorld Wide WebDatabasesDELIVERYInformation and Computing SciencesJournal ArticleHumansMembrane vesicleMolecular BiologyFactualEXOSOMESComputational BiologyCELLSDatabase Management SystemsExtracellular SpacecomputerSoftware
researchProduct

CoCoDat: a database system for organizing and selecting quantitative data on single neurons and neuronal microcircuitry.

2004

We present a novel database system for organizing and selecting quantitative experimental data on single neurons and neuronal microcircuitry that has proven useful for reference-keeping, experimental planning and computational modelling. Building on our previous experience with large neuroscientific databases, the system takes into account the diversity and method-dependence of single cell and microcircuitry data and provides tools for entering and retrieving published data without a priori interpretation or summarizing. Data representation is based on the framework suggested by biophysical theory and enables flexible combinations of data on membrane conductances, ionic and synaptic current…

Computer sciencecomputer.internet_protocolRelational databaseModels NeurologicalAction PotentialsInformation Storage and Retrievalcomputer.software_genreMachine learningExternal Data RepresentationData retrievalAnimalsComputer SimulationLayer (object-oriented design)NeuronsDatabasebusiness.industryGeneral NeuroscienceExperimental dataRatsData sharingScalabilityDatabase Management SystemsArtificial intelligenceNeural Networks ComputerNerve NetbusinesscomputerXMLJournal of neuroscience methods
researchProduct

Controlling false match rates in record linkage using extreme value theory

2011

AbstractCleansing data from synonyms and homonyms is a relevant task in fields where high quality of data is crucial, for example in disease registries and medical research networks. Record linkage provides methods for minimizing synonym and homonym errors thereby improving data quality. We focus our attention to the case of homonym errors (in the following denoted as ‘false matches’), in which records belonging to different entities are wrongly classified as equal. Synonym errors (‘false non-matches’) occur when a single entity maps to multiple records in the linkage result. They are not considered in this study because in our application domain they are not as crucial as false matches. Fa…

Data cleansingData cleansingBiomedical ResearchDatabases FactualCalibration (statistics)Computer scienceHealth Informaticscomputer.software_genrePlot (graphics)Mean excess plotStatisticsRegistriesExtreme value theoryLinkage (software)Models StatisticalComputational BiologyFellegi–Sunter modelMixture modelGeneralized Pareto distributionComputer Science ApplicationsData qualityStatistics of extreme valuesDatabase Management SystemsMedical Record LinkageData miningcomputerAlgorithmsMedical InformaticsRecord linkageJournal of Biomedical Informatics
researchProduct

Metadata to Support Data Warehouse Evolution

2009

The focus of this chapter is metadata necessary to support data warehouse evolution. We present the data warehouse framework that is able to track evolution process and adapt data warehouse schemata and data extraction, transformation, and loading (ETL) processes. We discuss the significant part of the framework, the metadata repository that stores information about the data warehouse, logical and physical schemata and their versions. We propose the physical implementation of multiversion data warehouse in a relational DBMS. For each modification of a data warehouse schema, we outline the changes that need to be made to the repository metadata and in the database.

Data elementInformation retrievalDatabaseComputer scienceInformationSystems_DATABASEMANAGEMENTcomputer.software_genreData warehouseMetadata repositorySchema evolutionMetadataRelational database management systemData extractionSchema (psychology)computer
researchProduct

A Rule-Based Multi-agent System for Local Traffic Management

2009

Road Traffic presents a high dynamism which makes necessary the development of traffic management and control strategies to improve traffic flows and more important, road safety. So it is needed the use of intelligent systems to support traffic organizations and road operators to cope with incidents. In this paper we introduce a local autonomous system for traffic management. This way, though there could be a breakdown in the communications between the local system and the TCC, the local system will be able to warn the road users in case of incidents. The system uses multiagent technology to work with the specific characteristics of traffic domain. The expert MAS system is ruled-based that …

Database Managementtraffic management systemComputer sciencebusiness.industryMulti-agent systemControl (management)multiagent systemIntelligent decision support systemRule-based systemVehicle Information and Communication SystemComputer securitycomputer.software_genreAdvanced Traffic Management SystemTraffic engineeringComputerSystemsOrganization_MISCELLANEOUSbusinessAutonomous system (mathematics)computer
researchProduct

Migration of Relational Database to Document-Oriented Database: Structure Denormalization and Data Transformation

2015

Relational databases remain the leading data storage technology. Nevertheless, many companies want to reduce operating expenses, to make scalable applications that use cloud computing technologies. Use of NoSQL database is one of the possible solutions, and it is forecasted that the NoSQL market will be growing at a CAGR of approximately 50 percent over the next five years. The paper offers a solution for quick data migration from a relational database into a document-oriented database. We have created semi-automatically two logical levels over physical data. Users can refine generated logical data model and configure data migration template for each needed document. Data migration features…

DatabaseRelational database management systemComputer scienceRelational databaseViewDatabase schemaDatabase theoryNoSQLcomputer.software_genrecomputerDatabase designDatabase model2015 7th International Conference on Computational Intelligence, Communication Systems and Networks
researchProduct

A completely automated CAD system for mass detection in a large mammographic database.

2006

Mass localization plays a crucial role in computer-aided detection (CAD) systems for the classification of suspicious regions in mammograms. In this article we present a completely automated classification system for the detection of masses in digitized mammographic images. The tool system we discuss consists in three processing levels: (a) Image segmentation for the localization of regions of interest (ROIs). This step relies on an iterative dynamical threshold algorithm able to select iso-intensity closed contours around gray level maxima of the mammogram. (b) ROI characterization by means of textural features computed from the gray tone spatial dependence matrix (GTSDM), containing secon…

Databases FactualInformation Storage and RetrievalReproducibility of ResultsBreast NeoplasmsSensitivity and SpecificityNeural networkPattern Recognition AutomatedRadiographic Image EnhancementBreast cancerTextural featuresRadiology Information SystemsImage processingComputer-aided detection (CAD)Artificial IntelligenceCluster AnalysisDatabase Management SystemsHumansRadiographic Image Interpretation Computer-AssistedFemaleBreast cancer; Computer-aided detection (CAD); Image processing; Mammographic mass detection; Neural network; Textural featuresMammographic mass detectionAlgorithmsMammographyMedical physics
researchProduct

Construction of a webgis tool based on a gis semiautomated processing for the localization of p2g plants in sicily (Italy)

2021

The recent diffusion of RES (Renewable Energy Sources), considering the electric energy produced by photovoltaic and wind plants, brought to light the problem of the unpredictable nature of wind and solar energy. P2G (Power to Gas) implementation seems to be the right solution, transforming curtailed energy in hydrogen. The choice of the settlement of P2G plants is linked to many factors like the distances between the gas grid and the settlement of RES plants, the transportation networks, the energy production, and population distribution. In light of this, the implementation of a Multi-Criteria Analysis (MCA) into a Geographic Information System (GIS) processing represents a good strategy …

Geographic information systemComputer scienceGeography Planning and DevelopmentPopulationcomputer.software_genreAsset (computer security)WebGISRelational database management systemEarth and Planetary Sciences (miscellaneous)P2GComputers in Earth ScienceseducationRDBMSeducation.field_of_studyGeography (General)Databasebusiness.industryPhotovoltaic systemMCAHydrogen technologiesGridRenewable energyRESG1-922businesscomputer
researchProduct