Search results for "Database Management Systems"

showing 10 items of 15 documents

FASTdoop: A versatile and efficient library for the input of FASTA and FASTQ files for MapReduce Hadoop bioinformatics applications

2017

Abstract Summary MapReduce Hadoop bioinformatics applications require the availability of special-purpose routines to manage the input of sequence files. Unfortunately, the Hadoop framework does not provide any built-in support for the most popular sequence file formats like FASTA or BAM. Moreover, the development of these routines is not easy, both because of the diversity of these formats and the need for managing efficiently sequence datasets that may count up to billions of characters. We present FASTdoop, a generic Hadoop library for the management of FASTA and FASTQ files. We show that, with respect to analogous input management routines that have appeared in the Literature, it offers…

0301 basic medicineFASTQ formatStatistics and ProbabilityComputer scienceSequence analysismedia_common.quotation_subjectInformation Storage and RetrievalBioinformaticscomputer.software_genreGenomeBiochemistryDomain (software engineering)03 medical and health sciencesComputational Theory and MathematicHumansGenomic libraryQuality (business)DNA sequencingFASTQ; NGS; FASTQ; DNA sequencingMolecular Biologymedia_commonGene LibrarySequenceDatabaseSettore INF/01 - InformaticaGenome HumanComputer Science Applications1707 Computer Vision and Pattern RecognitionGenomicsSequence Analysis DNAFASTQFile formatComputer Science ApplicationsStatistics and Probability; Biochemistry; Molecular Biology; Computer Science Applications1707 Computer Vision and Pattern Recognition; Computational Theory and Mathematics; Computational MathematicsComputational Mathematics030104 developmental biologyComputational Theory and MathematicsNGSDatabase Management Systemscomputer
researchProduct

BlotBase: a northern blot database.

2008

With the availability of high-throughput gene expression analysis, multiple public expression databases emerged, mostly based on microarray expression data. Although these databases are of significant biomedical value, they do hold significant drawbacks, especially concerning the reliability of single gene expression profiles obtained by microarray data. Simultaneously, reliable data on an individual gene's expression are often published as single northern blots in individual publications. These data were not yet available for high-throughput screening. To reduce the gap between high-throughput expression data and individual highly reliable expression data, we designed a novel database "Blo…

Bar chartHUGO Gene Nomenclature CommitteeValue (computer science)Information Storage and RetrievalBiologycomputer.software_genrePolymerase Chain Reactionlaw.inventionMicelawGeneticsComputer GraphicsMicroarray databasesAnimalsHumansNorthern blotDatabases ProteinDNA PrimersInternetDatabaseMicroarray analysis techniquesSequence Analysis RNAGene Expression ProfilingFull text searchComputational BiologyGeneral MedicineBlotting NorthernGene expression profilingDatabase Management SystemscomputerSoftwareGene
researchProduct

EVpedia: a community web portal for extracellular vesicles research

2014

Abstract Motivation: Extracellular vesicles (EVs) are spherical bilayered proteolipids, harboring various bioactive molecules. Due to the complexity of the vesicular nomenclatures and components, online searches for EV-related publications and vesicular components are currently challenging. Results: We present an improved version of EVpedia, a public database for EVs research. This community web portal contains a database of publications and vesicular components, identification of orthologous vesicular components, bioinformatic tools and a personalized function. EVpedia includes 6879 publications, 172 080 vesicular components from 263 high-throughput datasets, and has been accessed more tha…

Biomedical ResearchDatabases FactualComputer scienceBioactive moleculesMedizinBioinformaticsBiochemistryMathematical SciencesUser-Computer InterfaceNon-U.S. Gov'tdatabasecomputer.programming_languagePLASMAMICROPARTICLESResearch Support Non-U.S. Gov'tbioinformaticsBiological SciencesOriginal PapersCANCERComputer Science ApplicationsIdentification (information)Cell and molecular biologyComputational MathematicsComputational Theory and MathematicsPROTEOMIC ANALYSISMEMBRANE-VESICLESEXPRESSIONStatistics and ProbabilityPROSTASOMESJavaBioinformaticsexosomesResearch SupportExtracellular vesiclesWorld Wide WebDatabasesDELIVERYInformation and Computing SciencesJournal ArticleHumansMembrane vesicleMolecular BiologyFactualEXOSOMESComputational BiologyCELLSDatabase Management SystemsExtracellular SpacecomputerSoftware
researchProduct

CoCoDat: a database system for organizing and selecting quantitative data on single neurons and neuronal microcircuitry.

2004

We present a novel database system for organizing and selecting quantitative experimental data on single neurons and neuronal microcircuitry that has proven useful for reference-keeping, experimental planning and computational modelling. Building on our previous experience with large neuroscientific databases, the system takes into account the diversity and method-dependence of single cell and microcircuitry data and provides tools for entering and retrieving published data without a priori interpretation or summarizing. Data representation is based on the framework suggested by biophysical theory and enables flexible combinations of data on membrane conductances, ionic and synaptic current…

Computer sciencecomputer.internet_protocolRelational databaseModels NeurologicalAction PotentialsInformation Storage and Retrievalcomputer.software_genreMachine learningExternal Data RepresentationData retrievalAnimalsComputer SimulationLayer (object-oriented design)NeuronsDatabasebusiness.industryGeneral NeuroscienceExperimental dataRatsData sharingScalabilityDatabase Management SystemsArtificial intelligenceNeural Networks ComputerNerve NetbusinesscomputerXMLJournal of neuroscience methods
researchProduct

Controlling false match rates in record linkage using extreme value theory

2011

AbstractCleansing data from synonyms and homonyms is a relevant task in fields where high quality of data is crucial, for example in disease registries and medical research networks. Record linkage provides methods for minimizing synonym and homonym errors thereby improving data quality. We focus our attention to the case of homonym errors (in the following denoted as ‘false matches’), in which records belonging to different entities are wrongly classified as equal. Synonym errors (‘false non-matches’) occur when a single entity maps to multiple records in the linkage result. They are not considered in this study because in our application domain they are not as crucial as false matches. Fa…

Data cleansingData cleansingBiomedical ResearchDatabases FactualCalibration (statistics)Computer scienceHealth Informaticscomputer.software_genrePlot (graphics)Mean excess plotStatisticsRegistriesExtreme value theoryLinkage (software)Models StatisticalComputational BiologyFellegi–Sunter modelMixture modelGeneralized Pareto distributionComputer Science ApplicationsData qualityStatistics of extreme valuesDatabase Management SystemsMedical Record LinkageData miningcomputerAlgorithmsMedical InformaticsRecord linkageJournal of Biomedical Informatics
researchProduct

A completely automated CAD system for mass detection in a large mammographic database.

2006

Mass localization plays a crucial role in computer-aided detection (CAD) systems for the classification of suspicious regions in mammograms. In this article we present a completely automated classification system for the detection of masses in digitized mammographic images. The tool system we discuss consists in three processing levels: (a) Image segmentation for the localization of regions of interest (ROIs). This step relies on an iterative dynamical threshold algorithm able to select iso-intensity closed contours around gray level maxima of the mammogram. (b) ROI characterization by means of textural features computed from the gray tone spatial dependence matrix (GTSDM), containing secon…

Databases FactualInformation Storage and RetrievalReproducibility of ResultsBreast NeoplasmsSensitivity and SpecificityNeural networkPattern Recognition AutomatedRadiographic Image EnhancementBreast cancerTextural featuresRadiology Information SystemsImage processingComputer-aided detection (CAD)Artificial IntelligenceCluster AnalysisDatabase Management SystemsHumansRadiographic Image Interpretation Computer-AssistedFemaleBreast cancer; Computer-aided detection (CAD); Image processing; Mammographic mass detection; Neural network; Textural featuresMammographic mass detectionAlgorithmsMammographyMedical physics
researchProduct

E-health progresses in Romania

2006

The paper is presenting the recent evolution of e-health aspects in Romania. Data presented are based on governmental reports. Surveys organized by the "Lucian Blaga" University of Sibiu and studies carried on by the national Institute for Research and Development in Informatics (I.C.I.) have shown that Romania has important health problems, from cardio vascular diseases (CVD) to cancer and infectious diseases, a high score on mortality and morbidity and a low one on natality. Poor management of the health sector did not help to solve all these problems. In the last 14 years there were several attempts to reform healthcare but none succeeded until now. The health insurance system is operati…

HRHISmedicine.medical_specialtyMedical Records Systems ComputerizedRomaniabusiness.industryPublic healthHealth Care SectorInternational healthHealth InformaticsPublic relationsClinical decision support systemHealth informaticsTelemedicineHealth promotionNursingHealth careHospital Information SystemsmedicineDatabase Management SystemsbusinessDelivery of Health Carehealth care economics and organizationsHealth policyForecastingInternational Journal of Medical Informatics
researchProduct

PASSIM – an open source software system for managing information in biomedical studies

2007

Abstract Background One of the crucial aspects of day-to-day laboratory information management is collection, storage and retrieval of information about research subjects and biomedical samples. An efficient link between sample data and experiment results is absolutely imperative for a successful outcome of a biomedical study. Currently available software solutions are largely limited to large-scale, expensive commercial Laboratory Information Management Systems (LIMS). Acquiring such LIMS indeed can bring laboratory information management to a higher level, but often implies sufficient investment of time, effort and funds, which are not always available. There is a clear need for lightweig…

Information managementBiomedical ResearchDatabases FactualMedical Records Systems ComputerizedComputer scienceBiomedical EngineeringInformation Storage and RetrievalSample (statistics)lcsh:Computer applications to medicine. Medical informaticsBiochemistryWorld Wide WebUser-Computer InterfaceDocumentationSoftwareArtificial IntelligenceStructural BiologyConfidentialitylcsh:QH301-705.5Molecular BiologyClinical Trials as Topicbusiness.industryApplied MathematicsSubject (documents)Computer Science ApplicationsManagement information systemslcsh:Biology (General)Database Management Systemslcsh:R858-859.7Programming LanguagesUser interfacebusinessSoftware
researchProduct

Distributed image retrieval on DAISY

2006

The paper describes an application of image retrieval based on DAISY architecture (distributed architecture for intelligent system). The creation of pictorial indexes may require a number of hours depending on the size of the pictorial data base. The problem can become more complex in the case of distributed database systems. In both cases a distributed architecture can be the natural and more efficient solution. DAISY architecture is based on the concept of co-operating behavioral agents supervised by a central engagement module. Preliminary experiments, to evaluate the performance of the system, have been performed on a astronomical database and coral image

Information retrievalSettore INF/01 - InformaticaDistributed databaseComputer scienceDistributed database management systemsMulti-agent systemArchitectureDistributed systems image retrievalBase (topology)Image retrievalImage (mathematics)Database index2003 IEEE International Workshop on Computer Architectures for Machine Perception
researchProduct

epiPATH: an information system for the storage and management of molecular epidemiology data from infectious pathogens.

2007

Abstract Background Most research scientists working in the fields of molecular epidemiology, population and evolutionary genetics are confronted with the management of large volumes of data. Moreover, the data used in studies of infectious diseases are complex and usually derive from different institutions such as hospitals or laboratories. Since no public database scheme incorporating clinical and epidemiological information about patients and molecular information about pathogens is currently available, we have developed an information system, composed by a main database and a web-based interface, which integrates both types of data and satisfies requirements of good organization, simple…

Interface (computing)PopulationData securityBiologycomputer.software_genreBioinformaticsWork relatedCommunicable Diseaseslcsh:Infectious and parasitic diseasesRelational database management systemDatabases GeneticInformation systemHumanslcsh:RC109-216RegistrieseducationDatabase servereducation.field_of_studyInternetMolecular EpidemiologyDatabase schemaData scienceInfectious DiseasesDatabase Management SystemscomputerSoftwareBMC infectious diseases
researchProduct