Search results for "Genomics"
showing 10 items of 1255 documents
Establishing gene models from the Pinus pinaster genome using gene capture and BAC sequencing
2016
Background In the era of DNA throughput sequencing, assembling and understanding gymnosperm mega-genomes remains a challenge. Although drafts of three conifer genomes have recently been published, this number is too low to understand the full complexity of conifer genomes. Using techniques focused on specific genes, gene models can be established that can aid in the assembly of gene-rich regions, and this information can be used to compare genomes and understand functional evolution. Results In this study, gene capture technology combined with BAC isolation and sequencing was used as an experimental approach to establish de novo gene structures without a reference genome. Probes were design…
2017
Hemipteran insects are well-known in their ability to establish symbiotic relationships with bacteria. Among them, heteropteran insects present an array of symbiotic systems, ranging from the most common gut crypt symbiosis to the more restricted bacteriome-associated endosymbiosis, which have only been detected in members of the superfamily Lygaeoidea and the family Cimicidae so far. Genomic data of heteropteran endosymbionts are scarce and have merely been analyzed from the Wolbachia endosymbiont in bed bug and a few gut crypt-associated symbionts in pentatomoid bugs. In this study, we present the first detailed genomic analysis of a bacteriome-associated endosymbiont of a phytophagous he…
A continuous genome assembly of the corkwing wrasse (Symphodus melops)
2018
Material suplementar disponível online em: https://doi.org/10.1016/j.ygeno.2018.04.009. The wrasses (Labridae) are one of the most successful and species-rich families of the Perciformes order of teleost fish. Its members display great morphological diversity, and occupy distinct trophic levels in coastal waters and coral reefs. The cleaning behaviour displayed by some wrasses, such as corkwing wrasse (Symphodus melops), is of particular interest for the salmon aquaculture industry to combat and control sea lice infestation as an alternative to chemicals and pharmaceuticals. There are still few genome assemblies available within this fish family for comparative and functional studies, despi…
AlkAniline-Seq: Profiling of m7 G and m3 C RNA Modifications at Single Nucleotide Resolution.
2018
RNA modifications play essential roles in gene expression regulation. Only seven out of >150 known RNA modifications are detectable transcriptome-wide by deep sequencing. Here we describe a new principle of RNAseq library preparation, which relies on a chemistry based positive enrichment of reads in the resulting libraries, and therefore leads to unprecedented signal-to-noise ratios. The proposed approach eschews conventional RNA sequencing chemistry and rather exploits the generation of abasic sites and subsequent aniline cleavage. The newly generated 5'-phosphates are used as unique entry for ligation of an adapter in library preparation. This positive selection, embodied in the AlkAnilin…
Applying Conceptual Modeling to Better Understand the Human Genome
2016
The objective of the work is to present the benefits of the application of Conceptual Modeling (CM) in complex domains, such as genomics. This paper explains the evolution of a Conceptual Schema of the Human Genome (CSHG), which seeks to provide a clear and precise understanding of the human genome. We want to highlighting all the advantages of the application of CM in a complex domain such as Genomic Information Systems (GeIS). We show how over time this model has evolved, thus we have discovered better forms of representation. As we advanced in exploring the domain, we understood that we should be extending and incorporating the new concepts detected into our model. Here we present and di…
Differential binding cell-SELEX method to identify cell-specific aptamers using high-throughput sequencing
2018
AbstractAptamers have in recent years emerged as a viable alternative to antibodies. High-throughput sequencing (HTS) has revolutionized aptamer research by increasing the number of reads from a few (using Sanger sequencing) to millions (using an HTS approach). Despite the availability and advantages of HTS compared to Sanger sequencing, there are only 50 aptamer HTS sequencing samples available on public databases. HTS data in aptamer research are primarily used to compare sequence enrichment between subsequent selection cycles. This approach does not take full advantage of HTS because the enrichment of sequences during selection can be due to inefficient negative selection when using live…
Next-generation sequencing: big data meets high performance computing
2017
The progress of next-generation sequencing has a major impact on medical and genomic research. This high-throughput technology can now produce billions of short DNA or RNA fragments in excess of a few terabytes of data in a single run. This leads to massive datasets used by a wide range of applications including personalized cancer treatment and precision medicine. In addition to the hugely increased throughput, the cost of using high-throughput technologies has been dramatically decreasing. A low sequencing cost of around US$1000 per genome has now rendered large population-scale projects feasible. However, to make effective use of the produced data, the design of big data algorithms and t…
Data mining approaches to identify biomineralization related sequences.
2015
Proteomics is an efficient high throughput technique developed to identify proteins from a crude extract using sequence homology. Advances in Next Generation Sequencing (NGS) have led to increase knowledge of several non-model species. In the field of calcium carbonate biomineralization, the paucity of available sequences (such as the ones of mollusc shells) is still a bottleneck in most proteomic studies. Indeed, this technique needs proteins databases to find homology. The aim of this study was to perform different data mining approaches in order to identify novel shell proteins. To this end, we disposed of several publicly non-model molluscs databases. Previously identified molluscan she…
Transcriptome Analysis of PA Gain and Loss of Function Mutants
2017
Functional genomics has become a forefront methodology for plant science thanks to the widespread development of microarray technology. While technical difficulties associated with the process of obtaining raw expression data have been diminishing, allowing the appearance of tremendous amounts of transcriptome data in different databases, a common problem using "omic" technologies remains: the interpretation of these data and the inference of its biological meaning. In order to assist to this complex task, a wide variety of software tools have been developed. In this chapter we describe our current workflow of the application of some of these analyses. We have used it to compare the transcr…
SpCLUST: Towards a fast and reliable clustering for potentially divergent biological sequences
2019
International audience; This paper presents SpCLUST, a new C++ package that takes a list of sequences as input, aligns them with MUSCLE, computes their similarity matrix in parallel and then performs the clustering. SpCLUST extends a previously released software by integrating additional scoring matrices which enables it to cover the clustering of amino-acid sequences. The similarity matrix is now computed in parallel according to the master/slave distributed architecture, using MPI. Performance analysis, realized on two real datasets of 100 nucleotide sequences and 1049 amino-acids ones, show that the resulting library substantially outperforms the original Python package. The proposed pac…