6533b82efe1ef96bd12933fe

RESEARCH PRODUCT

Querying and reasoning over large scale building data sets

C. ZhangTarcisio Mendes De FariasJakob BeetzAna RoxinPieter PauwelsJos De RooChristophe Nicolle

subject

Computer scienceData managementBig data[ INFO.INFO-WB ] Computer Science [cs]/Web0211 other engineering and technologiesifcOWL02 engineering and technologySemantic data modelcomputer.software_genreDomain (software engineering)[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]Set (abstract data type)benchmarksemantic webbig data021105 building & construction0202 electrical engineering electronic engineering information engineering[ INFO.INFO-AI ] Computer Science [cs]/Artificial Intelligence [cs.AI]Semantic Web[INFO.INFO-DB]Computer Science [cs]/Databases [cs.DB]business.industry[INFO.INFO-WB]Computer Science [cs]/WebData set[ INFO.INFO-DB ] Computer Science [cs]/Databases [cs.DB]Building information modelingBenchmark (computing)reasoning020201 artificial intelligence & image processingData miningbusinesscomputer

description

International audience; The architectural design and construction domains work on a daily basis with massive amounts of data. Properly managing, exchanging and exploiting these data is an ever ongoing challenge in this domain. This has resulted in large semantic RDF graphs that are to be combined with a significant number of other data sets (building product catalogues, regulation data, geometric point cloud data, simulation data, sensor data), thus making an already huge dataset even larger. Making these big data available at high performance rates and speeds and into the correct (intuitive) formats is therefore an incredibly high challenge in this domain. Yet, hardly any benchmark is available for this industry that (1) gives an overview of the kind of data typically handled in this domain; and (2) that lists the query and reasoning performance results in handling these data. In this article, we therefore present a set of available sample data that explicates the scale of the situation, and we additionally perform a query and reasoning performance benchmark. This results not only in an initial set of quantitative performance results, but also in recommendations in implementing a web-based system relying heavily on large semantic data. As such, we propose an initial benchmark through which new upcoming data management proposals in the architectural design and construction domains can be measured.

10.1145/2928294.2928303https://doi.org/10.1145/2928294.2928303