6533b7d8fe1ef96bd126ad9c
RESEARCH PRODUCT
Superposing significant interaction rules (SSIR) method: a simple procedure for rapid ranking of congeneric compounds
Emili BesalúLionello PoglianiJesús Vicente De Julián-ortizsubject
Simple (abstract algebra)Computer sciencebusiness.industryQuímica combinatòriaPattern recognitionCombinatorial chemistrySSIR method; Congener series; Ranking; SAR; Balanced Leave-two-out cross validation (BL2O)General ChemistryArtificial intelligenceQuímicabusinessRanking (information retrieval)description
The Superposing Significant Interaction Rules (SSIR) method is revised and implemented. The method is a simple combinatorial procedure, which deals with in situ generated rules among a dichotomized congeneric molecular family, selecting the most probabilistically relevant ones. The mere counting of the number of relevant rules attached to new compounds generates a molecular ranking useful for database filtering, refinement and prediction. The algorithm only needs for a symbolic molecular representation and this allows for mining the database in a confidential manner. Third parties will not know the real compounds that are on the way to be worked out. The procedure is tested for a complete series of substituted amino acids. Areas under the receiver operating characteristic (AU-ROC) are always greater than 0.9 for all the following tried protocols: Training, leave-one out, balanced leave-two-out and 5-fold cross validations and, finally, a stochastic series of calculations combined with a randomization test The author acknowledges the Generalitat de Catalunya (Departament d'Innovació, Universitats i Empresa) for the financial support given to the QTMEM (Química teòrica i Modelatge i Enginyeria Molecular) research group of the University of Girona (code 2014-SGR- 1202)
year | journal | country | edition | language |
---|---|---|---|---|
2020-02-15 |