6533b85ffe1ef96bd12c0fec

RESEARCH PRODUCT

<strong>New tool useful for drug discovery validated through benchmark datasets</strong>

Yaile CaballeroStephen J. BarigyeYovani Marrero-ponceFrancisco TorrensJulio MaderaEfrain Chaluisa QuishpeReisel Millán CabreraYoan Martínez-lópezJuan A. Castillo-garitJuan A. Castillo-garit

subject

Set (abstract data type)Quantitative structure–activity relationshipOrthogonalityComputer scienceMolecular descriptorPrincipal component analysisGenetic algorithmBenchmark (computing)Data miningInformation theorycomputer.software_genrecomputer

description

Atomic Weighted Vectors (AWVs) are vectors that contain the codified information of molecular structures, which can apply to a set of Aggregation Operators (AOs) to calculate total and local molecular descriptors (MDs). This article presents an exploratory study of a new tool useful for drug discovery using different datasets, such as DRAGON and Sutherland’s datasets, as well as their comparison with other well-known approaches. In order to evaluate the performance of the tool, several statistics and QSAR/QSPR experiments were performed. Variability analyses are used to quantify the information content of the AWVs obtained from the tool, by the way of an information theory-based algorithm. Principal Components Analysis (PCA) is used to analyze the orthogonality of these descriptors, for which the new MDs from AWVs provide different information from those codified by DRAGON descriptors (0-2D). The QSAR models are obtained for every Sutherland’s dataset, according to the original division into training/test sets, by means of the Multiple Linear Regression with Genetic Algorithm (MLR-GA). These models have been validated and compare favorably to several approaches previously published, using the same benchmark datasets. The obtained results show that this tool should be a useful strategy for the QSAR/QSPR studies, despite its simplicity.

https://doi.org/10.3390/mol2net-04-05132