Search results for " Regression"
showing 10 items of 1835 documents
Bond-extended stochastic and nonstochastic bilinear indices. I. QSPR/QSAR applications to the description of properties/activities of small-medium si…
2010
Bond-extended stochastic and nonstochastic bilinear indices are introduced in this article as novel bond-level molecular descriptors (MDs). These novel totals (whole-molecule) MDs are based on bilinear maps (forms) similar to use defined in linear algebra. The proposed nonstochastic indices try to match molecular structure provided by the molecular topology by using the kth Edge(Bond)-Adjacency Matrix (Ek, designed here as a nonstochastic E matrix). The stochastic parameters are computed by using the kth stochastic edge-adjacency matrix, ESk, as matrix operators of bilinear transformations. This new edge (bond)-adjacency relationship can be obtained directly from Ek and can be considered li…
Application of the modelling power approach to variable subset selection for GA-PLS QSAR models
2007
A previously developed function, the Modelling Power Plot, has been applied to QSARs developed using partial least squares (PLS) following variable selection from a genetic algorithm (GA). Modelling power (Mp) integrates the predictive and descriptive capabilities of a QSAR. With regard to QSARs for narcotic toxic potency, Mp was able to guide the optimal selection of variables using a GA. The results emphasise the importance of Mp to assess the success of the variable selection and that techniques such as PLS are more robust following variable selection.
Calculation of chromatographic properties of barbiturates by molecular topology
1995
A study has been made of the relationship between the RF values obtained by thin layer chromatography for a group of barbiturates and the connectivity indices proposed by Kier and Hall. By using multivariable regression we obtained the corresponding connectivity functions, which were selected on the basis of their respective statistics parameters. The regression analysis of the connectivity functions shows a correct prediction of the experimental elution sequence for this group of molecules on silicagel with two mobile phases of different polarity. The corresponding random and stability studies of the different prediction models selected were carried out, demonstrating good stability and nu…
Atom-based 3D-chiral quadratic indices. Part 2: prediction of the corticosteroid-binding globulinbinding affinity of the 31 benchmark steroids data s…
2005
A quantitative structure-activity relationship (QSAR) study to predict the relative affinities of the steroid 'benchmark' data set to the corticosteroid-binding globulin (CBG) is described. It is shown that the 3D-chiral quadratic indices closely correlate with the measured CBG affinity values for the 31 steroids. The calculated descriptors were correlated with biological data through multiple linear regressions. Two statistically significant models were obtained when non-stochastic (R = 0.924 and s = 0.46) as well as stochastic (R = 0.929 and s = 0.46) 3D-chiral quadratic indices were used. A leave-one-out (LOO) approach to model validation is used here; the best results obtained in the cr…
Protein linear indices of the ‘macromolecular pseudograph α-carbon atom adjacency matrix’ in bioinformatics. Part 1: Prediction of protein stability …
2005
Abstract A novel approach to bio-macromolecular design from a linear algebra point of view is introduced. A protein’s total (whole protein) and local (one or more amino acid) linear indices are a new set of bio-macromolecular descriptors of relevance to protein QSAR/QSPR studies. These amino-acid level biochemical descriptors are based on the calculation of linear maps on R n [ f k ( x m i ) : R n → R n ] in canonical basis. These bio-macromolecular indices are calculated from the kth power of the macromolecular pseudograph α-carbon atom adjacency matrix. Total linear indices are linear functional on R n . That is, the kth total linear indices are linear maps from R n to the scalar R [ f k …
Prediction of acute toxicity of phenol derivatives using multiple linear regression approach for Tetrahymena pyriformis contaminant identification in…
2016
In this article, the modeling of inhibitory grown activity against Tetrahymena pyriformis is described. The 0-2D Dragon descriptors based on structural aspects to gain some knowledge of factors influencing aquatic toxicity are mainly used. Besides, it is done by some enlarged data of phenol derivatives described for the first time and composed of 358 chemicals. It overcomes the previous datasets with about one hundred compounds. Moreover, the results of the model evaluation by the parameters in the training, prediction and validation give adequate results comparable with those of the previous works. The more influential descriptors included in the model are: X3A, MWC02, MWC10 and piPC03 wit…
A novel approach to predict aquatic toxicity from molecular structure
2008
The main aim of the study was to develop quantitative structure-activity relationship (QSAR) models for the prediction of aquatic toxicity using atom-based non-stochastic and stochastic linear indices. The used dataset consist of 392 benzene derivatives, separated into training and test sets, for which toxicity data to the ciliate Tetrahymena pyriformis were available. Using multiple linear regression, two statistically significant QSAR models were obtained with non-stochastic (R2=0.791 and s=0.344) and stochastic (R2=0.799 and s=0.343) linear indices. A leave-one-out (LOO) cross-validation procedure was carried out achieving values of q2=0.781 (scv=0.348) and q2=0.786 (scv=0.350), respecti…
Prediction of ionic liquid's heat capacity by means of their in silico principal properties
2016
The in silico principal properties (PPs) of ionic liquids (ILs), derived by means of the VolSurf+ approach, were used to develop a Partial Least Squares (PLS) model able to find a quantitative correlation among IL descriptors (accounting for both cationic and anionic structural features) and heat capacity values, providing affordable predictions validated by experimental Cp measurements for an external set of ILs. In silico predictions allowed the selection of a limited number of structurally different ILs with similar Cp values, providing the possibility to select an optimal IL according to efficiency, as well as to environmental and economic sustainability. The present general procedure, …
tomocomd-camps and protein bilinear indices - novel bio-macromolecular descriptors for protein research: I. Predicting protein stability effects of a…
2010
Descriptors calculated from a specific representation scheme encode only one part of the chemical information. For this reason, there is a need to construct novel graphical representations of proteins and novel protein descriptors that can provide new information about the structure of proteins. Here, a new set of protein descriptors based on computation of bilinear maps is presented. This novel approach to biomacromolecular design is relevant for QSPR studies on proteins. Protein bilinear indices are calculated from the kth power of nonstochastic and stochastic graph–theoretic electronic-contact matrices, and , respectively. That is to say, the kth nonstochastic and stochastic protein bili…
Atom-Based Quadratic Indices to Predict Aquatic Toxicity of Benzene Derivatives to <i>Tetrahymena pyriformis</i>
2009
The non-stochastic and stochastic atom-based quadratic indices are applied to develop quantitative structure-activity relationship (QSAR) models for the prediction of aquatic toxicity. The used dataset, consisting of 392 benzene derivatives for which toxicity data to the ciliate Tetrahymena pyriformis were available, is divided into training and test sets. The obtained multiple linear regression models are statistically significant (R2 = 0.787 and s = 0.347, R2 = 0.806 and s = 0.329, for non-stochastic and stochastic quadratic indices, respectively) and show rather good stability in a cross-validation experiment (q2 = 0.769 and scv = 0.357, q2 = 0.791 and scv = 0.337, correspondingly). In a…