6533b86efe1ef96bd12cac5f
RESEARCH PRODUCT
CheS-Mapper 2.0 for visual validation of (Q)SAR models
Martin GütleinStefan KramerAndreas Karwathsubject
Visualization methodsComputer scienceFeature vectorLibrary and Information Sciencescomputer.software_genre01 natural sciences(Q)SARModel validation03 medical and health sciencesSoftwareValidationOverall performancePhysical and Theoretical ChemistryVisualization030304 developmental biology0303 health sciencesbusiness.industryStatistical validationComputer Graphics and Computer-Aided Design0104 chemical sciencesComputer Science ApplicationsVisualization010404 medicinal & biomolecular chemistry3d space3D spaceData miningbusinesscomputerSoftwaredescription
Abstract Background Sound statistical validation is important to evaluate and compare the overall performance of (Q)SAR models. However, classical validation does not support the user in better understanding the properties of the model or the underlying data. Even though, a number of visualization tools for analyzing (Q)SAR information in small molecule datasets exist, integrated visualization methods that allow the investigation of model validation results are still lacking. Results We propose visual validation, as an approach for the graphical inspection of (Q)SAR model validation results. The approach applies the 3D viewer CheS-Mapper, an open-source application for the exploration of small molecules in virtual 3D space. The present work describes the new functionalities in CheS-Mapper 2.0, that facilitate the analysis of (Q)SAR information and allows the visual validation of (Q)SAR models. The tool enables the comparison of model predictions to the actual activity in feature space. The approach is generic: It is model-independent and can handle physico-chemical and structural input features as well as quantitative and qualitative endpoints. Conclusions Visual validation with CheS-Mapper enables analyzing (Q)SAR information in the data and indicates how this information is employed by the (Q)SAR model. It reveals, if the endpoint is modeled too specific or too generic and highlights common properties of misclassified compounds. Moreover, the researcher can use CheS-Mapper to inspect how the (Q)SAR model predicts activity cliffs. The CheS-Mapper software is freely available at http://ches-mapper.org. Graphical abstract Comparing actual and predicted activity values with CheS-Mapper.
year | journal | country | edition | language |
---|---|---|---|---|
2014-09-01 | Journal of Cheminformatics |