6533b85cfe1ef96bd12bd320

RESEARCH PRODUCT

A Comparative Study of Nonlinear Machine Learning for the "In Silico" Depiction of Tyrosinase Inhibitory Activity from Molecular Structure.

Maria Del Carmen ChávezMaría M. GarcíaFrancisco TorrensGerardo M. Casañola-martinGladys Casas CardosoConcepción AbadYovani Marrero-ponceHuong Le-thi-thuCarlos Morell

subject

Virtual screeningArtificial neural networkComputer sciencebusiness.industryOrganic ChemistryMachine learningcomputer.software_genreComputer Science ApplicationsSupport vector machineData setStructural BiologyMolecular descriptorTest setDrug DiscoveryMultiple comparisons problemMolecular MedicineArtificial intelligencebusinesscomputerChemical database

description

In the preset report, for the first time, support vector machine (SVM), artificial neural network (ANN), Baye- sian networks (BNs), k-nearest neighbor (k-NN) are applied and compared on two "in-house" datasets to describe the tyrosinase inhibitory activity from the molecular structure. The data set Data I is used for the identification of tyrosi- nase inhibitors (TIs) including 701 active and 728 inactive compounds. Data II consists of active chemicals for potency estimation of TIs. The 2D TOMOCOMD-CARDD atom-based quadratic indices are used as molecular descriptors. The de- rived models show rather encouraging results with the areas under the Receiver Operating Characteristic (AURC) curve in the test set above 0.943 and 0.846 for the Data I and Data II, respectively. Multiple comparison tests are car- ried out to compare the performance of the models and reveal the improvement of machine learning (ML) tech- niques with respect to statistical ones (see Chemometr. Intell. Lab. Syst. 2010, 104, 249). In some cases, these ameli- orations are statistically significant. The tests also demo- strate that k-NN, despite being a rather simple approach, presents the best behavior in both data. The obtained re- sults suggest that the ML-based models could help to im- prove the virtual screening procedures and the confluence of these different techniques can increase the practicality of data mining procedures of chemical databases for the discovery of novel TIs as possible depigmenting agents.

10.1002/minf.201100021https://pubmed.ncbi.nlm.nih.gov/27467154