Search results for "Database"

showing 10 items of 2136 documents

Acceleration of short and long DNA read mapping without loss of accuracy using suffix array

2014

HPG Aligner applies suffix arrays for DNA read mapping. This implementation produces a highly sensitive and extremely fast mapping of DNA reads that scales up almost linearly with read length. The approach presented here is faster (over 20 for long reads) and more sensitive (over 98% in a wide range of read lengths) than the current state-of-the-art mappers. HPG Aligner is not only an optimal alternative for current sequencers but also the only solution available to cope with longer reads and growing throughputs produced by forthcoming sequencing technologies.

Statistics and ProbabilityComputer scienceSequence analysisSequence alignmentdatabase searchescomputer.software_genreBiochemistrylaw.inventionAccelerationchemistry.chemical_compoundlawCIENCIAS DE LA COMPUTACION E INTELIGENCIA ARTIFICIALAnimalsHumansMolecular BiologyDatabasesequencing dataSuffix arraySequence analysisHigh-Throughput Nucleotide SequencingalignmentSequence Analysis DNAApplications NotesComputer Science ApplicationsComputational MathematicsComputational Theory and MathematicschemistryDrosophilaSuffixSequence AlignmentcomputerAlgorithmAlgorithmsSoftwareDNA
researchProduct

WOODIV, a database of occurrences, functional traits, and phylogenetic data for all Euro-Mediterranean trees

2021

Trees play a key role in the structure and function of many ecosystems worldwide. In the Mediterranean Basin, forests cover approximately 22% of the total land area hosting a large number of endemics (46 species). Despite its particularities and vulnerability, the biodiversity of Mediterranean trees is not well known at the taxonomic, spatial, functional, and genetic levels required for conservation applications. The WOODIV database fills this gap by providing reliable occurrences, four functional traits (plant height, seed mass, wood density, and specific leaf area), and sequences from three DNA-regions (rbcL, matK, and trnH-psbA), together with modelled occurrences and a phylogeny for all…

Statistics and ProbabilityData DescriptorDatabases FactualMediterranean RegionConservation biologySettore BIO/02 - Botanica SistematicaScienceQBiodiversityLibrary and Information SciencesTreesComputer Science ApplicationsEducationBiogeographySettore BIO/03 - Botanica Ambientale E Applicata[SDE]Environmental SciencesForestCommunity ecologyStatistics Probability and UncertaintyForest ecologyEcosystemPhylogenyInformation Systems
researchProduct

Spanish electoral archive. SEA database

2021

This paper introduces the SEA database (acronym for Spanish Electoral Archive). SEA brings together the most complete public repository available to date on Spanish election outcomes. SEA holds all the results recorded from the electoral processes of General (1979–2019), Regional (1989–2021), Local (1979–2019) and European Parliamentary (1987–2019) elections held in Spain since the restoration of democracy in the late 70 s, in addition to other data sets with electoral content. The data are offered for free and is presented in a homogeneous and friendly format. Most of the databases are available for download with data from various electoral levels, including from the ballot box level. This…

Statistics and ProbabilityData DescriptorHistoryDownloadSciencemedia_common.quotation_subject0211 other engineering and technologiesInference02 engineering and technologyLibrary and Information Sciencescomputer.software_genre01 natural sciencesEducation010104 statistics & probabilitySociologyVotingPolitical scienceAcronymSociety0101 mathematicsmedia_commonDatabaseQPolitics021107 urban & regional planningTurnoutDemocracyComputer Science ApplicationsMetadataBallotGovernmentEconomia Mètodes estadísticsStatistics Probability and UncertaintycomputerInformation SystemsScientific Data
researchProduct

A database for the monitoring of thermal anomalies over the Amazon forest and adjacent intertropical oceans

2015

AbstractAdvances in information technologies and accessibility to climate and satellite data in recent years have favored the development of web-based tools with user-friendly interfaces in order to facilitate the dissemination of geo/biophysical products. These products are useful for the analysis of the impact of global warming over different biomes. In particular, the study of the Amazon forest responses to drought have recently received attention by the scientific community due to the occurrence of two extreme droughts and sustained warming over the last decade. Thermal Amazoni@ is a web-based platform for the visualization and download of surface thermal anomalies products over the Ama…

Statistics and ProbabilityData DescriptorRainforestDatabases FactualDownloadOceans and SeasBiomeRainforestLibrary and Information SciencesGlobal WarmingEducationEffects of global warmingServerBaseline (configuration management)Global warmingTropical ecologyComputer Science ApplicationsOceanographyClimatologyEnvironmental scienceSatelliteForest ecologyStatistics Probability and UncertaintyClimate-change impactsSoftwareInformation SystemsScientific Data
researchProduct

Galaxy LIMS for next-generation sequencing.

2013

Abstract Summary: We have developed a laboratory information management system (LIMS) for a next-generation sequencing (NGS) laboratory within the existing Galaxy platform. The system provides lab technicians standard and customizable sample information forms, barcoded submission forms, tracking of input sample quality, multiplex-capable automatic flow cell design and automatically generated sample sheets to aid physical flow cell preparation. In addition, the platform provides the researcher with a user-friendly interface to create a request, submit accompanying samples, upload sample quality measurements and access to the sequencing results. As the LIMS is within the Galaxy platform, the …

Statistics and ProbabilityDatabasebusiness.industryComputer scienceSample (material)Interface (computing)High-Throughput Nucleotide Sequencingcomputer.software_genreBiochemistryDNA sequencingComputer Science ApplicationsWorkflowWorld Wide WebComputational MathematicsUser-Computer InterfaceSoftwareComputational Theory and MathematicsbusinessMolecular BiologycomputerSoftwareInformation SystemsBioinformatics (Oxford, England)
researchProduct

Textual data compression in computational biology: a synopsis.

2009

Abstract Motivation: Textual data compression, and the associated techniques coming from information theory, are often perceived as being of interest for data communication and storage. However, they are also deeply related to classification and data mining and analysis. In recent years, a substantial effort has been made for the application of textual data compression techniques to various computational biology tasks, ranging from storage and indexing of large datasets to comparison and reverse engineering of biological networks. Results: The main focus of this review is on a systematic presentation of the key areas of bioinformatics and computational biology where compression has been use…

Statistics and ProbabilityDatabases Factualbusiness.industryComputer sciencemedia_common.quotation_subjectSearch engine indexingcompression dataComputational BiologyInformation Storage and RetrievalComputational biologyBiochemistryData scienceComputer Science ApplicationsComputational MathematicsPresentationSoftwareComputational Theory and MathematicsBenchmark (computing)businessMolecular BiologyBiological networkSoftwareData compressionmedia_commonBioinformatics (Oxford, England)
researchProduct

The conditional censored graphical lasso estimator

2020

© 2020, Springer Science+Business Media, LLC, part of Springer Nature. In many applied fields, such as genomics, different types of data are collected on the same system, and it is not uncommon that some of these datasets are subject to censoring as a result of the measurement technologies used, such as data generated by polymerase chain reactions and flow cytometer. When the overall objective is that of network inference, at possibly different levels of a system, information coming from different sources and/or different steps of the analysis can be integrated into one model with the use of conditional graphical models. In this paper, we develop a doubly penalized inferential procedure for…

Statistics and ProbabilityFOS: Computer and information sciencesComputer scienceGaussianInferenceData typeTheoretical Computer Sciencehigh-dimensional settingDatabase normalizationMethodology (stat.ME)symbols.namesakeLasso (statistics)Graphical modelConditional Gaussian graphical modelcensored graphical lassoStatistics - MethodologyHigh-dimensional settingconditional Gaussian graphical modelssparsityEstimatorCensoring (statistics)Censored graphical lassoComputational Theory and MathematicssymbolsCensored dataStatistics Probability and UncertaintySettore SECS-S/01 - StatisticaSparsityAlgorithm
researchProduct

Comparative Evaluation of Community Detection Algorithms: A Topological Approach

2012

International audience; Community detection is one of the most active fields in complex networks analysis, due to its potential value in practical applications. Many works inspired by different paradigms are devoted to the development of algorithmic solutions allowing to reveal the network structure in such cohesive subgroups. Comparative studies reported in the literature usually rely on a performance measure considering the community structure as a partition (Rand Index, Normalized Mutual information, etc.). However, this type of comparison neglects the topological properties of the communities. In this article, we present a comprehensive comparative study of a representative set of commu…

Statistics and ProbabilityFOS: Computer and information sciencesPhysics - Physics and SocietyComputer science[INFO.INFO-OH]Computer Science [cs]/Other [cs.OH]Rand indexFOS: Physical sciences02 engineering and technologyPhysics and Society (physics.soc-ph)Topology01 natural sciencesMeasure (mathematics)010305 fluids & plasmasSet (abstract data type)Development (topology)0103 physical sciences0202 electrical engineering electronic engineering information engineeringEquivalence (measure theory)Random graphSocial and Information Networks (cs.SI)Computer Science - Social and Information NetworksStatistical and Nonlinear PhysicsNetwork dynamicsPartition (database)[ INFO.INFO-OH ] Computer Science [cs]/Other [cs.OH]020201 artificial intelligence & image processingStatistics Probability and Uncertainty
researchProduct

Updating input–output matrices: assessing alternatives through simulation

2009

A problem that frequently arises in economics, demography, statistics, transportation planning and stochastic modelling is how to adjust the entries of a matrix to fulfil row and column aggregation constraints. Biproportional methods in general and the so-called RAS algorithm in particular, have been used for decades to find solutions to this type of problem. Although alternatives exist, the RAS algorithm and its extensions are still the most popular. Apart from some interesting empirical and theoretical properties, tradition, simplicity and very low computational costs are among the reasons behind the great success of RAS. Nowadays computer hardware and software have made alternative proce…

Statistics and ProbabilityInput/outputTransportation planningMathematical optimizationIterative proportional fittingbusiness.industryStochastic modellingApplied Mathematicsmedia_common.quotation_subjectColumn (database)Matrix (mathematics)SoftwareModeling and SimulationSimplicityStatistics Probability and UncertaintybusinessMathematicsmedia_commonJournal of Statistical Computation and Simulation
researchProduct

Assessment of the probabilities for evolutionary structural changes in protein folds.

2007

Abstract Motivation: The evolution of protein sequences can be described by a stepwise process, where each step involves changes of a few amino acids. In a similar manner, the evolution of protein folds can be at least partially described by an analogous process, where each step involves comparatively simple changes affecting few secondary structure elements. A number of such evolution steps, justified by biologically confirmed examples, have previously been proposed by other researchers. However, unlike the situation with sequences, as far as we know there have been no attempts to estimate the comparative probabilities for different kinds of such structural changes. Results: We have tried …

Statistics and ProbabilityModels MolecularProtein FoldingProtein domainStructural alignmentBiologyBiochemistrySet (abstract data type)Evolution MolecularProtein structureSimilarity (network science)Sequence Analysis ProteinComputer SimulationMolecular BiologyProtein secondary structureConserved SequenceSequenceModels GeneticSequence Homology Amino AcidProteinsStructural Classification of Proteins databaseComputer Science ApplicationsComputational MathematicsComputational Theory and MathematicsModels ChemicalData Interpretation Statisticalsense organsAlgorithmSequence AlignmentBioinformatics (Oxford, England)
researchProduct