6533b858fe1ef96bd12b6598

RESEARCH PRODUCT

Open Data Quality Evaluation: A Comparative Analysis of Open Data in Latvia

Anastasija Nikiforova

subject

FOS: Computer and information sciencesGeneral Computer ScienceComputer sciencemedia_common.quotation_subjectStakeholderLatvianDatabases (cs.DB)Statistics - ApplicationsStatistics - Computationlanguage.human_languageComputer Science - Information RetrievalComputer Science - Computers and SocietyOpen dataLead (geology)Computer Science - DatabasesRisk analysis (engineering)Data qualityComputers and Society (cs.CY)languageMainstreamQuality (business)Applications (stat.AP)Information Retrieval (cs.IR)Computation (stat.CO)media_common

description

Nowadays open data is entering the mainstream - it is free available for every stakeholder and is often used in business decision-making. It is important to be sure data is trustable and error-free as its quality problems can lead to huge losses. The research discusses how (open) data quality could be assessed. It also covers main points which should be considered developing a data quality management solution. One specific approach is applied to several Latvian open data sets. The research provides a step-by-step open data sets analysis guide and summarizes its results. It is also shown there could exist differences in data quality depending on data supplier (centralized and decentralized data releases) and, unfortunately, trustable data supplier cannot guarantee data quality problems absence. There are also underlined common data quality problems detected not only in Latvian open data but also in open data of 3 European countries.

https://dx.doi.org/10.48550/arxiv.2007.04697