Search results for "biological data"
showing 10 items of 53 documents
openSNP–A Crowdsourced Web Resource for Personal Genomics
2014
Genome-Wide Association Studies are widely used to correlate phenotypic traits with genetic variants. These studies usually compare the genetic variation between two groups to single out certain Single Nucleotide Polymorphisms (SNPs) that are linked to a phenotypic variation in one of the groups. However, it is necessary to have a large enough sample size to find statistically significant correlations. Direct-To-Consumer (DTC) genetic testing can supply additional data: DTC-companies offer the analysis of a large amount of SNPs for an individual at low cost without the need to consult a physician or geneticist. Over 100,000 people have already been genotyped through Direct-To-Consumer genet…
Compression-based classification of biological sequences and structures via the Universal Similarity Metric: experimental assessment.
2007
Abstract Background Similarity of sequences is a key mathematical notion for Classification and Phylogenetic studies in Biology. It is currently primarily handled using alignments. However, the alignment methods seem inadequate for post-genomic studies since they do not scale well with data set size and they seem to be confined only to genomic and proteomic sequences. Therefore, alignment-free similarity measures are actively pursued. Among those, USM (Universal Similarity Metric) has gained prominence. It is based on the deep theory of Kolmogorov Complexity and universality is its most novel striking feature. Since it can only be approximated via data compression, USM is a methodology rath…
Identification of biological targets through the correlation between cell line chemosensitivity and protein expression pattern.
2021
Matching biological data sequences is one of the most interesting ways to discover new bioactive compounds. In particular, matching cell chemosensitivity with a protein expression profile can be a useful approach to predict the activity of compounds against definite biological targets. In this review, we discuss this correlation. First, we analyze case studies in which some known drugs, acting on known targets, show a good correlation between their antiproliferative activities and protein expression when a large panel of tumor cells is considered. Then, we highlight how the application of in silico methods based on the correlation between cell line chemosensitivity and gene/protein expressi…
Characterization of Bacillus thuringiensis isolates by their insecticidal activity and their production of Cry and Vip3 proteins.
2018
WOS: 000449027600099 PubMed ID: 30383811 Bacillus thuringiensis (Bt) constitutes the active ingredient of many successful bioinsecticides used in agriculture. In the present study, the genetic diversity and toxicity of Bt isolates was investigated by characterization of native isolates originating from soil, fig leaves and fruits from a Turkish collection. Among a total of 80 Bt isolates, 18 of them were found carrying a vip3 gene (in 23% of total), which were further selected. Insecticidal activity of spore/crystal mixtures and their supernatants showed that some of the Bt isolates had significantly more toxicity against some lepidopteran species than the HD1 reference strain. Five isolate…
Palaeogenetics and cultural heritage. Species determination and STR-genotyping from ancient DNA in art and artefacts
2000
Abstract In recent years, a few papers have addressed the palaeogenetic analysis of cultural, historical and archaeological artefacts. We provide an overview of the individual published articles and then describe the results we had in the framework of a palaeogenetic research project involving various historical and prehistoric finds from museums, archaeological excavations, and libraries. We show that ancient DNA can be isolated from most of the various biomaterials (leather, parchment, glue, binding media, crusted organic plant remains in containers). Short pieces of degraded DNA are used, on the one hand, to determine the organic remnant’s genus/species of origin, and on the other hand, …
BIOfid dataset: publishing a German gold standard for named entity recognition in historical biodiversity literature
2019
The Specialized Information Service Biodiversity Research (BIOfid) has been launched to mobilize valuable biological data from printed literature hidden in German libraries for over the past 250 years. In this project, we annotate German texts converted by OCR from historical scientific literature on the biodiversity of plants, birds, moths and butterflies. Our work enables the automatic extraction of biological information previously buried in the mass of papers and volumes. For this purpose, we generated training data for the tasks of Named Entity Recognition (NER) and Taxa Recognition (TR) in biological documents. We use this data to train a number of leading machine learning tools and c…
CiliaCarta: An integrated and validated compendium of ciliary genes
2019
The cilium is an essential organelle at the surface of mammalian cells whose dysfunction causes a wide range of genetic diseases collectively called ciliopathies. The current rate at which new ciliopathy genes are identified suggests that many ciliary components remain undiscovered. We generated and rigorously analyzed genomic, proteomic, transcriptomic and evolutionary data and systematically integrated these using Bayesian statistics into a predictive score for ciliary function. This resulted in 285 candidate ciliary genes. We generated independent experimental evidence of ciliary associations for 24 out of 36 analyzed candidate proteins using multiple cell and animal model systems (mouse…
Three-dimensional cardiac computational modelling: methods, features and applications
2015
[EN] The combination of computational models and biophysical simulations can help to interpret an array of experimental data and contribute to the understanding, diagnosis and treatment of complex diseases such as cardiac arrhythmias. For this reason, three-dimensional (3D) cardiac computational modelling is currently a rising field of research. The advance of medical imaging technology over the last decades has allowed the evolution from generic to patient-specific 3D cardiac models that faithfully represent the anatomy and different cardiac features of a given alive subject. Here we analyse sixty representative 3D cardiac computational models developed and published during the last fifty …
An Integrative Framework for the Construction of Big Functional Networks
2018
We present a methodology for biological data integration, aiming at building and analysing large functional networks which model complex genotype-phenotype associations. A functional network is a graph where nodes represent cellular components (e.g., genes, proteins, mRNA, etc.) and edges represent associations among such molecules. Different types of components may cohesist in the same network, and associations may be related to physical[biochemical interactions or functional/phenotipic relationships. Due to both the large amount of involved information and the computational complexity typical of the problems in this domain, the proposed framework is based on big data technologies (Spark a…
Textual data compression in computational biology: Algorithmic techniques
2012
Abstract In a recent review [R. Giancarlo, D. Scaturro, F. Utro, Textual data compression in computational biology: a synopsis, Bioinformatics 25 (2009) 1575–1586] the first systematic organization and presentation of the impact of textual data compression for the analysis of biological data has been given. Its main focus was on a systematic presentation of the key areas of bioinformatics and computational biology where compression has been used together with a technical presentation of how well-known notions from information theory have been adapted to successfully work on biological data. Rather surprisingly, the use of data compression is pervasive in computational biology. Starting from…