Search results for "data structure"

showing 10 items of 441 documents

Parallel Construction and Query of Index Data Structures for Pattern Matching on Square Matrices

1999

AbstractWe describe fast parallel algorithms for building index data structures that can be used to gather various statistics on square matrices. The main data structure is the Lsuffix tree, which is a generalization of the classical suffix tree for strings. Given ann×ntext matrixA, we build our data structures inO(logn) time withn2processors on a CRCW PRAM, so that we can quickly processAin parallel as follows: (i) report some statistical information aboutA, e.g., find the largest repeated square submatrices that appear at least twice inAor determine, for each position inA, the smallest submatrix that occurs only there; (ii) given, on-line, anm×mpattern matrixPAT, check whether it occurs i…

Statistics and ProbabilityNumerical AnalysisControl and OptimizationAlgebra and Number TheoryApplied MathematicsGeneral MathematicsSuffix treeParallel algorithmData structureSquare matrixSquare (algebra)law.inventionTree (data structure)lawPattern matchingAlgorithmMathematicsData compressionJournal of Complexity
researchProduct

kmcEx: memory-frugal and retrieval-efficient encoding of counted k-mers.

2018

Abstract Motivation K-mers along with their frequency have served as an elementary building block for error correction, repeat detection, multiple sequence alignment, genome assembly, etc., attracting intensive studies in k-mer counting. However, the output of k-mer counters itself is large; very often, it is too large to fit into main memory, leading to highly narrowed usability. Results We introduce a novel idea of encoding k-mers as well as their frequency, achieving good memory saving and retrieval efficiency. Specifically, we propose a Bloom filter-like data structure to encode counted k-mers by coupled-bit arrays—one for k-mer representation and the other for frequency encoding. Exper…

Statistics and ProbabilitySource codeComputer sciencemedia_common.quotation_subject0206 medical engineeringHash function02 engineering and technologyBiochemistry03 medical and health sciencesEncoding (memory)Molecular BiologyTime complexity030304 developmental biologyBlock (data storage)media_common0303 health sciencesSequence Analysis DNAData structureComputer Science ApplicationsComputational MathematicsComputational Theory and MathematicsError detection and correctionAlgorithmSequence Alignment020602 bioinformaticsAlgorithmsSoftwareBioinformatics (Oxford, England)
researchProduct

Building up adjusted indicators of students' evaluation of university courses using generalized item response models

2012

This article advances a proposal for building up adjusted composite indicators of the quality of university courses from students’ assessments. The flexible framework of Generalized Item Response Models is adopted here for controlling the sources of heterogeneity in the data structure that make evaluations across courses not directly comparable. Specifically, it allows us to: jointly model students’ ratings to the set of items which define the quality of university courses; explicitly consider the dimensionality of the items composing the evaluation form; evaluate and remove the effect of potential confounding factors which may affect students’ evaluation; model the intra-cluster variabilit…

Statistics and ProbabilityStructure (mathematical logic)Computer sciencemedia_common.quotation_subjectadjusted indicators explanatory item response models multidimensional latent traits multilevel models evaluation of university courses potential confounding factorsRegression analysisData structureAffect (psychology)Multilevel dataComputingMilieux_COMPUTERSANDEDUCATIONEconometricsMathematics educationQuality (business)Settore SECS-S/05 - Statistica SocialeStatistics Probability and UncertaintySet (psychology)Settore SECS-S/01 - Statisticamedia_commonCurse of dimensionality
researchProduct

Contributed discussion on article by Pratola

2016

The author should be commended for his outstanding contribution to the literature on Bayesian regression tree models. The author introduces three innovative sampling approaches which allow for efficient traversal of the model space. In this response, we add a fourth alternative.

Statistics and Probabilitymodel selectionMarkov Chain Monte Carlo (MCMC)Bayesian regression treeComputer scienceBig dataBayesian regression tree (BRT) modelsComputingMilieux_LEGALASPECTSOFCOMPUTINGbirth–death processMachine learningcomputer.software_genreSequential Monte Carlo methods01 natural sciencespopulation Markov chain Monte Carlo010104 statistics & probabilitysymbols.namesakebig data0502 economics and businessBayesian Regression Trees (BART)0101 mathematics050205 econometrics Bayesian treed regressionMultiple Try Metropolis algorithmsINFERÊNCIA ESTATÍSTICAbusiness.industryApplied MathematicsModel selection05 social sciencesRejection samplingData scienceVariable-order Bayesian networkTree (data structure)Tree traversalMarkov chain Monte Carlocontinuous time Markov processsymbolsArtificial intelligencebusinessBayesian linear regressioncommunication-freecomputerGibbs samplingBayesian Analysis
researchProduct

Recent applications of point process methods in forestry statistics

2000

Forestry statistics is an important field of applied statistics with a long tradition. Many forestry problems can be solved by means of point processes or marked point processes. There, the "points" are tree locations and the "marks" are tree characteristics such as diameter at breast height or degree of damage by environmental factors. Point pro- cess characteristics are valuable tools for exploratory data analysis in forestry, for describing the variability of forest stands and for under- standing and quantifying ecological relationships. Models of point pro- cesses are also an important basis of modern single-tree modeling, that gives simulation tools for the investigation of forest stru…

Statistics and Probabilitysingle-tree modelsecond order characteristicThinningComputer scienceGeneral MathematicsDiameter at breast heightForestrymodelingvariability indicesField (geography)Point processTree (data structure)Exploratory data analysisEcological relationshipmarkcorrelationStatisticsPoint (geometry)Statistics Probability and UncertaintyecologyGibbs processintensityCox processPoint process
researchProduct

Flexible strategic planning of transport systems

2012

Abstract This paper presents a decision support methodology for long-range planning of transport systems that exhibits strategic flexibility and stochastic system parameters. Unlike one-off strategic decisions, flexible decisions should be dynamically reformulated with time. The proposed methodology is based on the construction of a tree structure of multiple interlinked tactical planning problems, each associated with a scenario in the tree, where problems under scenarios at intermediate dates incorporate in their formulation the solution of the corresponding problems associated with past (future) connected scenarios. The resulting tree structure of interconnected planning decisions become…

Strategic planningFlexibility (engineering)Decision support systemEngineeringOperations researchManagement sciencebusiness.industryGeography Planning and DevelopmentDecision treeTransportationTree (data structure)Tree structureBusiness decision mappingbusinessFleet managementTransportation Planning and Technology
researchProduct

Repetitiveness Measures based on String Attractors and Burrows-Wheeler Transform: Properties and Applications

2023

String AttractorSettore INF/01 - InformaticaMeasure of repetitiveneBurrows-Wheeler TransformCompressed Data StructuresData CompressionCombinatorics on WordStringology
researchProduct

TB-Structure: Collective Intelligence for Exploratory Keyword Search

2017

In this paper we address an exploratory search challenge by presenting a new (structure-driven) collaborative filtering technique. The aim is to increase search effectiveness by predicting implicit seeker’s intents at an early stage of the search process. This is achieved by uncovering behavioral patterns within large datasets of preserved collective search experience. We apply a specific tree-based data structure called a TB (There-and-Back) structure for compact storage of search history in the form of merged query trails – sequences of queries approaching iteratively a seeker’s goal. The organization of TB-structures allows inferring new implicit trails for the prediction of a seeker’s i…

Structure (mathematical logic)Information retrievalComputer science05 social sciencesCollective intelligenceInferenceExploratory search02 engineering and technologyData structureTree (data structure)020204 information systems0202 electrical engineering electronic engineering information engineeringCollaborative filtering0509 other social sciences050904 information & library sciences
researchProduct

Boolean operations with implicit and parametric representation of primitives using R-functions

2005

We present a new and efficient algorithm to accurately polygonize an implicit surface generated by multiple Boolean operations with globally deformed primitives. Our algorithm is special in the sense that it can be applied to objects with both an implicit and a parametric representation, such as superquadrics, supershapes, and Dupin cyclides. The input is a constructive solid geometry tree (CSG tree) that contains the Boolean operations, the parameters of the primitives, and the global deformations. At each node of the CSG tree, the implicit formulations of the subtrees are used to quickly determine the parts to be transmitted to the parent node, while the primitives' parametric definition …

Surface (mathematics)Theoretical computer scienceComputer scienceInformation Storage and Retrieval02 engineering and technologyConstructive solid geometryImaging Three-DimensionalParametric surfaceSuperquadricsImage Interpretation Computer-Assisted[ INFO.INFO-TI ] Computer Science [cs]/Image Processing0202 electrical engineering electronic engineering information engineeringparametric surfaceDifferentiable functionBoolean functionRepresentation (mathematics)ComputingMilieux_MISCELLANEOUSComputingMethodologies_COMPUTERGRAPHICSParametric statisticsGielis curveImplicit functionNumerical analysis020207 software engineeringNumerical Analysis Computer-Assistedsupershape[ INFO.INFO-GR ] Computer Science [cs]/Graphics [cs.GR]Computational geometryImage EnhancementComputer Graphics and Computer-Aided Design[INFO.INFO-GR]Computer Science [cs]/Graphics [cs.GR]Vertex (geometry)Tree (data structure)Mesh generation[INFO.INFO-TI]Computer Science [cs]/Image Processing [eess.IV]Signal ProcessingCurve fitting020201 artificial intelligence & image processingComputer Vision and Pattern RecognitionAlgorithmSoftwareAlgorithms
researchProduct

SimpleBIM: From full ifcOWL graphs to simplified building graphs

2016

International audience; Recent research in semantic web technologies for the built environment has resulted in several proposals to further improve information exchange among stakeholders from the domain. Most notable is the production of several OWL ontologies that allow to capture building data in RDF graphs. For example, an ifcOWL ontology allows to capture IFC data in an RDF graph. As the building data is now available in a semantic graph with an explicit formal basis, it can be restructured and simplified so that it more easily matches the different requirements associated with practical use case scenarios. In this paper, we investigate several proposals and technological approaches to…

Technology and Engineeringbuilding data[INFO.INFO-CL] Computer Science [cs]/Computation and Language [cs.CL][INFO.INFO-DS]Computer Science [cs]/Data Structures and Algorithms [cs.DS]ifcOWL[INFO.INFO-DS] Computer Science [cs]/Data Structures and Algorithms [cs.DS][ INFO.INFO-CL ] Computer Science [cs]/Computation and Language [cs.CL]IFCBIMlinked data[ INFO.INFO-DS ] Computer Science [cs]/Data Structures and Algorithms [cs.DS][INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]OWL
researchProduct