6533b82ffe1ef96bd1295c95

RESEARCH PRODUCT

A Sliding Window-Based Method to Detect Selective Constraints in Protein-Coding Genes and Its Application to RNA Viruses

Andrés MoyaJavier OrtizMario A. FaresSantiago F. ElenaEladio Barrio

subject

Nonsynonymous substitutionGenes ViralSequence alignmentBiologyGenes envEvolution MolecularViral ProteinsSliding window protocolGeneticsRNA VirusesSelection GeneticMolecular BiologyGenePhylogenyEcology Evolution Behavior and SystematicsSelection (genetic algorithm)GeneticsBase SequenceReproducibility of ResultsContrast (statistics)RNAWindow (computing)Genes gagFoot-and-Mouth Disease VirusDNA ViralHIV-1Capsid ProteinsSequence AlignmentAlgorithm

description

Here we present a new sliding window-based method specially designed to detect selective constraints in specific regions of a multiple protein-coding sequence alignment. In contrast to previous window-based procedures, our method is based on a nonarbitrary statistical approach to find the appropriate codon-window size to test deviations of synonymous (d(S)) and nonsynonymous (d(N)) nucleotide substitutions from the expectation. The probabilities of d(N) and d(S) are obtained from simulated data and used to detect significant deviations of d(N) and d(S) in a specific window region of the real sequence alignment. The nonsynonymous-to-synonymous rate ratio (w = d(N)/d(S)) was used to highlight selective constraints in any window wherein d(S) or d(N) was significantly different from the expectation. In these significant windows, w and its variance [V(w)] were calculated and used to test the neutral hypothesis. Computer simulations showed that the method is accurate even for highly divergent sequences. The main advantages of the new method are that it (i) uses a statistically appropriate window size to detect different selective patterns, (ii) is computationally less intensive than maximum likelihood methods, and (iii) detects saturation of synonymous sites, which can give deviations from neutrality. Hence, it allows the analysis of highly divergent sequences and the test of different alternative hypothesis as well. The application of the method to different human immunodeficiency virus type 1 and to foot-and-mouth disease virus genes confirms the action of positive selection on previously described regions as well as on new regions.

https://doi.org/10.1007/s00239-002-2346-9