6533b853fe1ef96bd12ace22

RESEARCH PRODUCT

Different mechanisms generating sequence variability are revealed in distinct regions of the hydroxyproline-rich glycoprotein gene from maize and related species.

Pere PuigdomènechAndrés MoyaRegina RazJosé MJosé Antonio Martínez-izquierdo

subject

Transposable elementGeneticsBase SequenceTATA boxMolecular Sequence DataNucleic acid sequenceIntronGenetic VariationBiologybiology.organism_classificationZea maysZea diploperennisHydroxyprolineMolecular evolutionSequence Homology Nucleic AcidGeneticsCoding regionAmino Acid SequenceMolecular BiologyGeneSequence AlignmentPhylogenyGlycoproteinsPlant Proteins

description

The sequences of the genes coding for a hydroxyproline-rich glycoprotein from two varieties of maize (Zea mays, Ac1503 and W22), a teosinte (Zea diploperennis) and sorghum (Sorghum vulgare) have been obtained and compared. Distinct patterns of variability have been observed along their sequences. The 500 bp region immediately upstream of the TATA box is highly conserved in the Zea species and contains stretches of sequences also found in the sorghum gene. Further upstream, significant rearrangements are observed, even between the two maize varieties. These observations allow definition of a 5' region, which is common to the four genes and is probably essential for their expression. The 3' end shows variability, mostly due to small duplications and single nucleotide substitutions. There is an intron present in this region showing a high degree of sequence conservation among the four genes analyzed. The coding region is the most divergent, but variability arises from duplications of fragments coding for similar protein blocks and from single nucleotide substitutions. These results indicate that a number of distinct mechanisms (probably point mutation, transposon insertion and excision, homologous recombination and unequal crossing-over) are active in the production of sequence variability in maize and related species. They are revealed in different parts of the gene, probably as the result of the different types of functional constraints acting on them, and of the specific nature of the sequence in each region.

10.1007/bf00587586https://pubmed.ncbi.nlm.nih.gov/1603067