6533b7d9fe1ef96bd126d4d7

RESEARCH PRODUCT

Retrained Classification of Tyrosinase Inhibitors and “In Silico” Potency Estimation by Using Atom-Type Linear Indices

Gerardo M. Casañola-martinMahmud Tareq Hassan KhanFrancisco TorrensYovani Marrero-ponceRamón García-domenechAntonio RescignoHuong Le-thi-thu

subject

Quantitative structure–activity relationshipEngineeringSpeedupbusiness.industryIn silicoAtom (order theory)Pattern recognitionLinear discriminant analysiscomputer.software_genreSet (abstract data type)Artificial intelligenceData miningbusinesscomputerSelection (genetic algorithm)Applicability domain

description

In this paper, the authors present an effort to increase the applicability domain (AD) by means of retraining models using a database of 701 great dissimilar molecules presenting anti-tyrosinase activity and 728 drugs with other uses. Atom-based linear indices and best subset linear discriminant analysis (LDA) were used to develop individual classification models. Eighteen individual classification-based QSAR models for the tyrosinase inhibitory activity were obtained with global accuracy varying from 88.15-91.60% in the training set and values of Matthews correlation coefficients (C) varying from 0.76-0.82. The external validation set shows globally classifications above 85.99% and 0.72 for C. All individual models were validated and fulfilled by OECD principles. A brief analysis of AD for the training set of 478 compounds and the new active compounds included in the re-training was carried out. Various assembled multiclassifier systems contained eighteen models using different selection criterions were obtained, which provide possibility of select the best strategy for particular problem. The various assembled multiclassifier systems also estimated the potency of active identified compounds. Eighteen validated potency models by OECD principles were used.

https://doi.org/10.4018/ijcce.2012070104