6533b7d3fe1ef96bd1260993

RESEARCH PRODUCT

JANE: efficient mapping of prokaryotic ESTs and variable length sequence reads on related template genomes

Chunguang LiangRoy GrossMaría José López-sánchezThomas DandekarThomas DandekarAndrés MoyaAlexander SchmidJörg Bernhardt

subject

Computational biologyBiologylcsh:Computer applications to medicine. Medical informaticsBiochemistryGenomeUser-Computer InterfaceStructural BiologyDatabases Geneticlcsh:QH301-705.5Molecular BiologySequence (medicine)Expressed Sequence TagsWhole genome sequencingGeneticsInternetExpressed sequence tagGenomeBase SequencePhylumApplied MathematicsNucleic acid sequenceComputational BiologySequence Analysis DNAComputer Science Applicationslcsh:Biology (General)Single cell sequencinglcsh:R858-859.7DNA microarraySoftware

description

Abstract Background ESTs or variable sequence reads can be available in prokaryotic studies well before a complete genome is known. Use cases include (i) transcriptome studies or (ii) single cell sequencing of bacteria. Without suitable software their further analysis and mapping would have to await finalization of the corresponding genome. Results The tool JANE rapidly maps ESTs or variable sequence reads in prokaryotic sequencing and transcriptome efforts to related template genomes. It provides an easy-to-use graphics interface for information retrieval and a toolkit for EST or nucleotide sequence function prediction. Furthermore, we developed for rapid mapping an enhanced sequence alignment algorithm which reassembles and evaluates high scoring pairs provided from the BLAST algorithm. Rapid assembly on and replacement of the template genome by sequence reads or mapped ESTs is achieved. This is illustrated (i) by data from Staphylococci as well as from a Blattabacteria sequencing effort, (ii) mapping single cell sequencing reads is shown for poribacteria to sister phylum representative Rhodopirellula Baltica SH1. The algorithm has been implemented in a web-server accessible at http://jane.bioapps.biozentrum.uni-wuerzburg.de. Conclusion Rapid prokaryotic EST mapping or mapping of sequence reads is achieved applying JANE even without knowing the cognate genome sequence.

https://doi.org/10.1186/1471-2105-10-391