Search results for "Data Science"
showing 10 items of 495 documents
EvalMSA: A Program to Evaluate Multiple Sequence Alignments and Detect Outliers
2016
8 páginas, 3 figuras, 2 tablas.
Making sense of big data in health research: {T}owards an {EU} action plan
2016
Genome medicine 8(1), 71 (2016). doi:10.1186/s13073-016-0323-y
The Hidden Charm of Life
2019
Synthetic biology is an engineering view on biotechnology, which has revolutionized genetic engineering. The field has seen a constant development of metaphors that tend to highlight the similarities of cells with machines. I argue here that living organisms, particularly bacterial cells, are not machine-like, engineerable entities, but, instead, factory-like complex systems shaped by evolution. A change of the comparative paradigm in synthetic biology from machines to factories, from hardware to software, and from informatics to economy is discussed.
Applying Conceptual Modeling to Better Understand the Human Genome
2016
The objective of the work is to present the benefits of the application of Conceptual Modeling (CM) in complex domains, such as genomics. This paper explains the evolution of a Conceptual Schema of the Human Genome (CSHG), which seeks to provide a clear and precise understanding of the human genome. We want to highlighting all the advantages of the application of CM in a complex domain such as Genomic Information Systems (GeIS). We show how over time this model has evolved, thus we have discovered better forms of representation. As we advanced in exploring the domain, we understood that we should be extending and incorporating the new concepts detected into our model. Here we present and di…
HPG pore: an efficient and scalable framework for nanopore sequencing data.
2016
The use of nanopore technologies is expected to spread in the future because they are portable and can sequence long fragments of DNA molecules without prior amplification. The first nanopore sequencer available, the MinION™ from Oxford Nanopore Technologies, is a USB-connected, portable device that allows real-time DNA analysis. In addition, other new instruments are expected to be released soon, which promise to outperform the current short-read technologies in terms of throughput. Despite the flood of data expected from this technology, the data analysis solutions currently available are only designed to manage small projects and are not scalable. Here we present HPG Pore, a toolkit for …
Next-generation sequencing: big data meets high performance computing
2017
The progress of next-generation sequencing has a major impact on medical and genomic research. This high-throughput technology can now produce billions of short DNA or RNA fragments in excess of a few terabytes of data in a single run. This leads to massive datasets used by a wide range of applications including personalized cancer treatment and precision medicine. In addition to the hugely increased throughput, the cost of using high-throughput technologies has been dramatically decreasing. A low sequencing cost of around US$1000 per genome has now rendered large population-scale projects feasible. However, to make effective use of the produced data, the design of big data algorithms and t…
Network Analysis: Ten Years Shining Light on Host–Parasite Interactions
2020
Biological interactions are key drivers of ecological and evolutionary processes. The complexity of such interactions hinders our understanding of ecological systems and our ability to make effective predictions in changing environments. However, network analysis allows us to better tackle the complexity of ecosystems because it extracts the properties of an ecological system according to the number and distribution of links among interacting entities. The number of studies using network analysis to solve ecological and evolutionary questions in parasitology has increased over the past decade. Here, we synthesise the contribution of network analysis toward disentangling host-parasite proces…
Guidelines for the use of flow cytometry and cell sorting in immunological studies (second edition)
2019
All authors: Andrea Cossarizza Hyun‐Dong Chang Andreas Radbruch Andreas Acs Dieter Adam Sabine Adam‐Klages William W. Agace Nima Aghaeepour Mübeccel Akdis Matthieu Allez Larissa Nogueira Almeida Giorgia Alvisi Graham Anderson Immanuel Andrä Francesco Annunziato Achille Anselmo Petra Bacher Cosima T. Baldari Sudipto Bari Vincenzo Barnaba Joana Barros‐Martins Luca Battistini Wolfgang Bauer Sabine Baumgart Nicole Baumgarth Dirk Baumjohann Bianka Baying Mary Bebawy Burkhard Becher Wolfgang Beisker Vladimir Benes Rudi Beyaert Alfonso Blanco Dominic A. Boardman Christian Bogdan Jessica G. Borger Giovanna Borsellino Philip E. Boulais Jolene A. Bradford Dirk Brenner Ryan R. Brinkman Anna E. S. Broo…
Perspective: Essential study quality descriptors for data from nutritional epidemiologic research
2017
Pooled analysis of secondary data increases the power of research and enables scientific discovery in nutritional epidemiology. Information on study characteristics that determine data quality is needed to enable correct reuse and interpretation of data. This study aims to define essential quality characteristics for data from observational studies in nutrition. First, a literature review was performed to get an insight on existing instruments that assess the quality of cohort, case-control, and cross-sectional studies and dietary measurement. Second, 2 face-to-face workshops were organized to determine the study characteristics that affect data quality. Third, consensus on the data descrip…
A REST-based framework to support non-invasive and early coeliac disease diagnosis
2019
The health sector has traditionally been one of the early adopters of databases, from the most simple Electronic Health Record (formerly Computer-Based Patient Record) systems in use in general practice, hospitals and intensive care units to big data, multidata based systems used to support diagnosis and care decisions. In this paper we present a framework to support non-invasive and early diagnosis of coeliac disease. The proposed framework makes use of well-known technologies and techniques, both hardware and software, put together in a novel way. The main goals of our framework are: (1) providing users with a reliable and fast repository of a large amount of data; (2) to make such reposi…