6533b855fe1ef96bd12aff1d
RESEARCH PRODUCT
On the power and the systematic biases of the detection of chromosomal inversions by paired-end genome sequencing
Mario CáceresMario CáceresJosé Ignacio Lucas-lledósubject
Evolutionary GeneticsChromosome Structure and Functionlcsh:MedicineComputational biologyBiologyGenomeDNA sequencingStructural variation03 medical and health sciences0302 clinical medicineGenetic MutationGeneticsFalse positive paradoxHumansComputer SimulationFalse Positive ReactionsGenomic libraryGenome Sequencinglcsh:ScienceBiologyGenome EvolutionFalse Negative Reactions030304 developmental biologyChromosomal inversionSegmental duplicationGeneticsEvolutionary Biology0303 health sciencesMultidisciplinaryChromosome Biologylcsh:RBreakpointMutation TypesComputational BiologyChromosome MappingGenomic EvolutionGenomicsSequence Analysis DNAComparative GenomicsChromosomes Human Pair 1Chromosome Inversionlcsh:QStructural GenomicsSequence AnalysisAlgorithms030217 neurology & neurosurgeryResearch Articledescription
One of the most used techniques to study structural variation at a genome level is paired-end mapping (PEM). PEM has the advantage of being able to detect balanced events, such as inversions and translocations. However, inversions are still quite difficult to predict reliably, especially from high-throughput sequencing data. We simulated realistic PEM experiments with different combinations of read and library fragment lengths, including sequencing errors and meaningful base-qualities, to quantify and track down the origin of false positives and negatives along sequencing, mapping, and downstream analysis. We show that PEM is very appropriate to detect a wide range of inversions, even with low coverage data. However, ≥% of inversions located between segmental duplications are expected to go undetected by the most common sequencing strategies. In general, longer DNA libraries improve the detectability of inversions far better than increments of the coverage depth or the read length. Finally, we review the performance of three algorithms to detect inversions--SVDetect, GRIAL, and VariationHunter--, identify common pitfalls, and reveal important differences in their breakpoint precisions. These results stress the importance of the sequencing strategy for the detection of structural variants, especially inversions, and offer guidelines for the design of future genome sequencing projects.
year | journal | country | edition | language |
---|---|---|---|---|
2013-04-23 |