Search results for "software"
showing 10 items of 7396 documents
Boosting Signal-to-Noise in Complex Biology: Prior Knowledge Is Power
2011
A major difficulty in the analysis of complex biological systems is dealing with the low signal-to-noise inherent to nearly all large biological datasets. We discuss powerful bioinformatic concepts for boosting signal-to-noise through external knowledge incorporated in processing units we call filters and integrators. These concepts are illustrated in four landmark studies that have provided model implementations of filters, integrators, or both.
Low-cost scalable discretization, prediction and feature selection for complex systems
2019
The introduced data-driven tool allows simultaneous feature selection, model inference, and marked cost and quality gains.
Comments from Pascal Schlich on the Steinsholt's paper
1998
International audience
SORT-CC: A procedure for the statistical treatment of free sorting data
2008
International audience; A statistical approach for the analysis of free sorting data is discussed. In a first stage, the sorting data from each subject are arranged into a dataset consisting of indicator variables which reflect the memberships of the stimuli to the groups formed by the subject under consideration. Thereafter, an appropriate standardization is applied on these data and a three way statistical method, namely Common Components and Specific Weights Analysis, is performed on the datasets thus obtained. This makes it possible to take account of the individual differences among the subjects and to depict graphical displays showing the relationships among the stimuli on the one han…
The Software Crisis of Synthetic Biology
2016
In fifteen years, Synthetic Biology (SB) has moved from proof-of-concept designs to several flagship achievements. Standardisation efforts are still under way, basic engineering concepts such as modularity and orthogonality are still controversial in biology, and making predictions from computer models is still unreliable. A deep characterization in the pattern of re-use of biological blocks in SB has not been attempted to date. We have compared the topological organisation of two different technological networks, one associated to a standard, large-scale software repository and the second provided by the Registry of Standard Biological Parts (RSBP). Our results strongly suggest that softwa…
Hyperion
2019
Indexes are essential in data management systems to increase the speed of data retrievals. Widespread data structures to provide fast and memory-efficient indexes are prefix tries. Implementations like Judy, ART, or HOT optimize their internal alignments for cache and vector unit efficiency. While these measures usually improve the performance substantially, they can have a negative impact on memory efficiency. In this paper we present Hyperion, a trie-based main-memory key-value store achieving extreme space efficiency. In contrast to other data structures, Hyperion does not depend on CPU vector units, but scans the data structure linearly. Combined with a custom memory allocator, Hyperion…
Main Steps in Image Processing and Quantification: The Analysis Workflow
2019
In the last decades, the variety of programs, algorithms, and strategies that researchers have at their disposal to process and analyze image files has grown extensively. However, these are only pointless tools if not applied with the careful planning required to achieve a succesful image analysis. In order to do so, the analyst must establish a meaningful and effective sequence of orderly operations that is able to (1) overcome all the problems derived from the image manipulation and (2) successfully resolve the question that was originally posed. In this chapter, the authors suggest a set of strategies and present a reflection on the main milestones that compose the image processing workf…
Correction: The landscape of epilepsy-related GATOR1 variants
2019
International audience; The original version of this article contained an error in the spelling of the author Erik H. Niks, which was incorrectly given as Erik Niks. This has now been corrected in both the PDF and HTML versions of the article.
EHRtemporalVariability: delineating temporal dataset shifts in electronic health records
2020
AbstractBackgroundTemporal variability in healthcare processes or protocols is intrinsic to medicine. Such variability can potentially introduce dataset shifts, a data quality issue when reusing electronic health records (EHRs) for secondary purposes. Temporal dataset shifts can present as trends, abrupt or seasonal changes in the statistical distributions of data over time, being particularly complex to address in multi-modal and highly coded data. These changes, if not delineated, can harm population and data-driven research, such as machine learning. Given that biomedical research repositories are increasingly being populated with large historical data from EHRs, there is a need for spec…
Visualizing Human Protein‐Protein Interactions and Subcellular Localizations on Cell Images Through CellMap
2020
Visualizing protein data remains a challenging and stimulating task. Useful and intuitive visualization tools may help advance biomolecular and medical research; unintuitive tools may bar important breakthroughs. This protocol describes two use cases for the CellMap (http://cellmap.protein.properties) web tool. The tool allows researchers to visualize human protein-protein interaction data constrained by protein subcellular localizations. In the simplest form, proteins are visualized on cell images that also show protein-protein interactions (PPIs) through lines (edges) connecting the proteins across the compartments. At a glance, this simultaneously highlights spatial constraints that prot…