Search results for "chromosome"
showing 10 items of 1175 documents
The Molecular Basis of X-Linked Spondyloepiphyseal Dysplasia Tarda
2001
The X-linked form of spondyloepiphyseal dysplasia tarda (SEDL), a radiologically distinct skeletal dysplasia affecting the vertebrae and epiphyses, is caused by mutations in the SEDL gene. To characterize the molecular basis for SEDL, we have identified the spectrum of SEDL mutations in 30 of 36 unrelated cases of X-linked SEDL ascertained from different ethnic populations. Twenty-one different disease-associated mutations now have been identified throughout the SEDL gene. These include nonsense mutations in exons 4 and 5, missense mutations in exons 4 and 6, small (2–7 bp) and large (>1 kb) deletions, insertions, and putative splicing errors, with one splicing error due to a complex deleti…
Musket: a multistage k-mer spectrum-based error corrector for Illumina sequence data
2012
Abstract Motivation: The imperfect sequence data produced by next-generation sequencing technologies have motivated the development of a number of short-read error correctors in recent years. The majority of methods focus on the correction of substitution errors, which are the dominant error source in data produced by Illumina sequencing technology. Existing tools either score high in terms of recall or precision but not consistently high in terms of both measures. Results: In this article, we present Musket, an efficient multistage k-mer-based corrector for Illumina short-read data. We use the k-mer spectrum approach and introduce three correction techniques in a multistage workflow: two-s…
SeqEditor: an application for primer design and sequence analysis with or without GTF/GFF files
2021
[Motivation]: Sequence analyses oriented to investigate specific features, patterns and functions of protein and DNA/RNA sequences usually require tools based on graphic interfaces whose main characteristic is their intuitiveness and interactivity with the user’s expertise, especially when curation or primer design tasks are required. However, interface-based tools usually pose certain computational limitations when managing large sequences or complex datasets, such as genome and transcriptome assemblies. Having these requirments in mind we have developed SeqEditor an interactive software tool for nucleotide and protein sequences’ analysis.
A non-linear optimization procedure to estimate distances and instantaneous substitution rate matrices under the GTR model.
2006
Abstract Motivation: The general-time-reversible (GTR) model is one of the most popular models of nucleotide substitution because it constitutes a good trade-off between mathematical tractability and biological reality. However, when it is applied for inferring evolutionary distances and/or instantaneous rate matrices, the GTR model seems more prone to inapplicability than more restrictive time-reversible models. Although it has been previously noted that the causes for intractability are caused by the impossibility of computing the logarithm of a matrix characterised by negative eigenvalues, the issue has not been investigated further. Results: Here, we formally characterize the mathematic…
Long read alignment based on maximal exact match seeds
2012
Abstract Motivation: The explosive growth of next-generation sequencing datasets poses a challenge to the mapping of reads to reference genomes in terms of alignment quality and execution speed. With the continuing progress of high-throughput sequencing technologies, read length is constantly increasing and many existing aligners are becoming inefficient as generated reads grow larger. Results: We present CUSHAW2, a parallelized, accurate, and memory-efficient long read aligner. Our aligner is based on the seed-and-extend approach and uses maximal exact matches as seeds to find gapped alignments. We have evaluated and compared CUSHAW2 to the three other long read aligners BWA-SW, Bowtie2 an…
A cladistic approach to testing phylogenomic evolution in Strepsirhines
2010
One is not enough: On the effects of reference genome for the mapping and subsequent analyses of short-reads.
2020
Mapping of high-throughput sequencing (HTS) reads to a single arbitrary reference genome is a frequently used approach in microbial genomics. However, the choice of a reference may represent a source of errors that may affect subsequent analyses such as the detection of single nucleotide polymorphisms (SNPs) and phylogenetic inference. In this work, we evaluated the effect of reference choice on short-read sequence data from five clinically and epidemiologically relevant bacteria (Klebsiella pneumoniae, Legionella pneumophila, Neisseria gonorrhoeae, Pseudomonas aeruginosa and Serratia marcescens). Publicly available whole-genome assemblies encompassing the genomic diversity of these species…
In silico characterization of an Iroquois family-related homeodomain protein.
2005
Homeobox genes have been demonstrated to play important roles during cancer differentiation and embryonic development. The subset of Iroquois-related homeobox genes (IRXs) have furthermore been. demonstrated to be involved in several embryonic developmental processes such as patterning of the anterior-posterior and dorso-ventral axis, as well as specific regions of the central nervous system, and differentiation of the otic vesicle, branchial epithelium, and limbs. We have characterized a novel homeodomain protein and corresponding gene by means of computational biology. Since the protein sequence displayed high similarity to the human IRX proteins, the newly identified homeodomain protein …
Focal DNA Copy Number Changes in Neuroblastoma Target MYCN Regulated Genes
2013
Neuroblastoma is an embryonic tumor arising from immature sympathetic nervous system cells. Recurrent genomic alterations include MYCN and ALK amplification as well as recurrent patterns of gains and losses of whole or large partial chromosome segments. A recent whole genome sequencing effort yielded no frequently recurring mutations in genes other than those affecting ALK. However, the study further stresses the importance of DNA copy number alterations in this disease, in particular for genes implicated in neuritogenesis. Here we provide additional evidence for the importance of focal DNA copy number gains and losses, which are predominantly observed in MYCN amplified tumors. A focal 5 kb…
snoRNPs Regulate Telomerase Activity in Neuroblastoma and Are Associated with Poor Prognosis
2013
AbstractAmplification of the MYCN oncogene is strongly associated with poor prognosis in neuroblastoma (NB). In addition to MYCN amplification, many studies have focused on identifying patients with a poor prognosis based on gene expression profiling. The majority of prognostic signatures today are comprised of large gene lists limiting their clinical application. In addition, although of prognostic significance,most of these signatures fail to identify cellular processes that can explain their relation to prognosis. Here, we determined prognostically predictive genes in a data set containing 251 NBs. Gene Ontology analysis was performed on significant genes with a positive hazard ratio to …