Comprehensive Strategy for Proton Chemical Shift Prediction: Linear Prediction with Nonlinear Corrections
A fast 3D/4D structure-sensitive procedure was developed and assessed for the chemical shift prediction of protons bonded to sp3carbons, which poses the maybe greatest challenge in the NMR spectral parameter prediction. The LPNC (Linear Prediction with Nonlinear Corrections) approach combines three well-established multivariate methods viz. the principal component regression (PCR), the random forest (RF) algorithm, and the k nearest neighbors (kNN) method. The role of RF is to find nonlinear corrections for the PCR predicted shifts, while kNN is used to take full advantage of similar chemical environments. Two basic molecular models were also compared and discussed: in the MC model the desc…