Search results for "RefSeq"
showing 7 items of 7 documents
GET_PHYLOMARKERS, a software package to select optimal orthologous clusters for phylogenomics and inferring pan-genome phylogenies, used for a critic…
2018
22 Pags.- 3 Tabls.- 7 Figs. Creative Commons License Attribution 4.0 International (CC BY 4.0).
MetaCache: context-aware classification of metagenomic reads using minhashing.
2017
Abstract Motivation Metagenomic shotgun sequencing studies are becoming increasingly popular with prominent examples including the sequencing of human microbiomes and diverse environments. A fundamental computational problem in this context is read classification, i.e. the assignment of each read to a taxonomic label. Due to the large number of reads produced by modern high-throughput sequencing technologies and the rapidly increasing number of available reference genomes corresponding software tools suffer from either long runtimes, large memory requirements or low accuracy. Results We introduce MetaCache—a novel software for read classification using the big data technique minhashing. Our…
The gypsy database (GyDB) of mobile genetic elements: release 2.0
2011
This article introduces the second release of the Gypsy Database of Mobile Genetic Elements (GyDB 2.0): a research project devoted to the evolutionary dynamics of viruses and transposable elements based on their phylogenetic classification (per lineage and protein domain). The Gypsy Database (GyDB) is a long-term project that is continuously progressing, and that owing to the high molecular diversity of mobile elements requires to be completed in several stages. GyDB 2.0 has been powered with a wiki to allow other researchers participate in the project. The current database stage and scope are long terminal repeats (LTR) retroelements and relatives. GyDB 2.0 is an update based on the analys…
Comparison of different assembly and annotation tools on analysis of simulated viral metagenomic communities in the gut
2013
Abstract Background The main limitations in the analysis of viral metagenomes are perhaps the high genetic variability and the lack of information in extant databases. To address these issues, several bioinformatic tools have been specifically designed or adapted for metagenomics by improving read assembly and creating more sensitive methods for homology detection. This study compares the performance of different available assemblers and taxonomic annotation software using simulated viral-metagenomic data. Results We simulated two 454 viral metagenomes using genomes from NCBI's RefSeq database based on the list of actual viruses found in previously published metagenomes. Three different ass…
Classic bladder exstrophy: Frequent 22q11.21 duplications and definition of a 414 kb phenocritical region
2014
Background: Classic bladder exstrophy (CBE) is the most common form of the bladder exstrophy and epispadias complex. Previously, we and others have identified four patients with a duplication of 22q11.21 among a total of 96 unrelated CBE patients. Methods: Here, we investigated whether this chromosomal aberration was commonly associated with CBE/bladder exstrophy and epispadias complex in an extended case-control sample. Multiplex ligation-dependent probe amplification and microarray-based analysis were used to identify 22q11.21 duplications in 244 unrelated bladder exstrophy and epispadias complex patients (including 217 CBE patients) and 665 healthy controls. Results: New duplications of …
In silico strategy for detection of target candidates for antibody therapy of solid tumors
2008
In contrast to earlier attempts for the identification of target candidates suitable for monoclonal antibody (mAb) based cancer therapies we concentrated on highly selective lineage-specific genes additionally preserved or even overexpressed in orthotopic cancers. In a script aided workflow we reduced all human entries of the RefSeq mRNA database to those encoding transmembrane domain bearing gene products and subjected them to BLAST analysis against the human EST database. All BLAST results were validated in a gene centric way allowing two types of data curation prior to expression profiling of matching ESTs in selected healthy tissues: (i) exclusion of questionable ESTs arising e.g. from …
Identification of new claudin family members by a novel PSI-BLAST based approach with enhanced specificity.
2006
In an attempt to develop a novel strategy for the identification of new members of protein families by in silico approaches, we have developed a semi-automated procedure of consecutive PSI-BLAST (Position-Specific-Iterated Basic Local Alignment Search Tool) searches incorporating identificiation as well as subsequent validation of putative candidates. For a proof of concept study we chose the search for novel members of the claudin family. The initial step was an iterated PSI-BLAST search starting with the PMP22_Claudin domain of each known member of the claudin family against the human part of the RefSeq Database. Putative new claudin domains derived from the converged list were evaluated …