6533b7d4fe1ef96bd1263103

RESEARCH PRODUCT

Automating statistical diagrammatic representations with data characterization

Pedro M. Valero-moraPere Millán-martínez

subject

business.industryComputer science020207 software engineeringCognition02 engineering and technologyGraphic designcomputer.software_genre01 natural sciencesCharacterization (materials science)010104 statistics & probabilityInformation visualizationDiagrammatic reasoningOpen dataHuman–computer interaction0202 electrical engineering electronic engineering information engineeringComputer Vision and Pattern RecognitionArtificial intelligence0101 mathematicsbusinesscomputerStatistical graphicsNatural language processingGraphical user interface

description

The search for an efficient method to enhance data cognition is especially important when managing data from multidimensional databases. Open data policies have dramatically increased not only the volume of data available to the public, but also the need to automate the translation of data into efficient graphical representations. Graphic automation involves producing an algorithm that necessarily contains inputs derived from the type of data. A set of rules are then applied to combine the input variables and produce a graphical representation. Automated systems, however, fail to provide an efficient graphical representation because they only consider either a one-dimensional characterization of variables, which leads to an overwhelmingly large number of available solutions, a compositional algebra that leads to a single solution, or requires the user to predetermine the graphical representation. Therefore, we propose a multidimensional characterization of statistical variables that when complemented with a catalog of graphical representations that match any single combination, presents the user with a more specific set of suitable graphical representations to choose from. Cognitive studies can then determine the most efficient perceptual procedures to further shorten the path to the most efficient graphical representations. The examples used herein are limited to graphical representations with three variables given that the number of combinations increases drastically as the number of selected variables increases.

https://doi.org/10.1177/1473871617715326