Search results for "Data set"
showing 10 items of 154 documents
A method to reduce the FP/imm number through CC and MLO views comparison in mammographic images
2008
In this paper we propose a method to reduce the FP/imm number through CC and MLO mammographic views comparison of the same patient. The proposed solution uses the symmetry properties of the breast to compute a geometric transformation that permits to represent the two images in comparable coordinates systems. Through this method, potential pathological ROIs of one of the projections are correlated with the ROIs in the second view. To show the effectiveness of the result we apply the method on a dataset composed of 112 couples of pathological images. Experiments shows that method enables a reduction by up to 700/0 of the FP/imm number detected after the classification step
A practical approach to improve the statistical performance of surface water monitoring networks
2019
The representativeness of aquatic ecosystem monitoring and the precision of the assessment results are of high importance when implementing the EU’s Water Framework Directive that aims to secure a good status of waterbodies in Europe. However, adapting monitoring designs to answer the objectives and allocating the sampling resources effectively are seldom practiced. Here, we present a practical solution how the sampling effort could be re-allocated without decreasing the precision and confidence of status class assignment. For demonstrating this, we used a large data set of 272 intensively monitored Finnish lake, coastal, and river waterbodies utilizing an existing framework for quantifying…
Multimodal Simulation of a Novel Device for a Safe and Effective External Ventricular Drain Placement
2021
BackgroundExternal ventricular drain (EVD) placement is mandatory for several pathologies. The misplacement rate of the EVD varies widely in literature, ranging from 12.3 to 60%. The purpose of this simulation study is to provide preliminary data about the possibility of increasing the safety of one of the most common life-saving procedures in neurosurgery by testing a new device for EVD placement.MethodsWe used a novel guide for positioning the ventricular catheter (patent RM2014A000376). The trajectory was assessed using 25 anonymized head CT scans. The data sets were used to conduct three-dimensional computer-based and combined navigation and augmented reality-based simulations using pla…
SMART: Unique splitting-while-merging framework for gene clustering
2014
© 2014 Fa et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Successful clustering algorithms are highly dependent on parameter settings. The clustering performance degrades significantly unless parameters are properly set, and yet, it is difficult to set these parameters a priori. To address this issue, in this paper, we propose a unique splitting-while-merging clustering framework, named "splitting merging awareness tactics" (SMART), which does not require any a priori knowledge of either the number …
Bayesian versus data driven model selection for microarray data
2014
Clustering is one of the most well known activities in scientific investigation and the object of research in many disciplines, ranging from Statistics to Computer Science. In this beautiful area, one of the most difficult challenges is a particular instance of the model selection problem, i.e., the identification of the correct number of clusters in a dataset. In what follows, for ease of reference, we refer to that instance still as model selection. It is an important part of any statistical analysis. The techniques used for solving it are mainly either Bayesian or data-driven, and are both based on internal knowledge. That is, they use information obtained by processing the input data. A…
Optimizing Kernel Ridge Regression for Remote Sensing Problems
2018
Kernel methods have been very successful in remote sensing problems because of their ability to deal with high dimensional non-linear data. However, they are computationally expensive to train when a large amount of samples are used. In this context, while the amount of available remote sensing data has constantly increased, the size of training sets in kernel methods is usually restricted to few thousand samples. In this work, we modified the kernel ridge regression (KRR) training procedure to deal with large scale datasets. In addition, the basis functions in the reproducing kernel Hilbert space are defined as parameters to be also optimized during the training process. This extends the n…
Compression-based classification of biological sequences and structures via the Universal Similarity Metric: experimental assessment.
2007
Abstract Background Similarity of sequences is a key mathematical notion for Classification and Phylogenetic studies in Biology. It is currently primarily handled using alignments. However, the alignment methods seem inadequate for post-genomic studies since they do not scale well with data set size and they seem to be confined only to genomic and proteomic sequences. Therefore, alignment-free similarity measures are actively pursued. Among those, USM (Universal Similarity Metric) has gained prominence. It is based on the deep theory of Kolmogorov Complexity and universality is its most novel striking feature. Since it can only be approximated via data compression, USM is a methodology rath…
Querying and reasoning over large scale building data sets
2016
International audience; The architectural design and construction domains work on a daily basis with massive amounts of data. Properly managing, exchanging and exploiting these data is an ever ongoing challenge in this domain. This has resulted in large semantic RDF graphs that are to be combined with a significant number of other data sets (building product catalogues, regulation data, geometric point cloud data, simulation data, sensor data), thus making an already huge dataset even larger. Making these big data available at high performance rates and speeds and into the correct (intuitive) formats is therefore an incredibly high challenge in this domain. Yet, hardly any benchmark is avai…
A framework for modelling the biomechanical behaviour of the human liver during breathing in real time using machine learning
2017
Progress in biomechanical modelling of human soft tissue is the basis for the development of new clinical applications capable of improving the diagnosis and treatment of some diseases (e.g. cancer), as well as the surgical planning and guidance of some interventions. The finite element method (FEM) is one of the most popular techniques used to predict the deformation of the human soft tissue due to its high accuracy. However, FEM has an associated high computational cost, which makes it difficult its integration in real-time computer-aided surgery systems. An alternative for simulating the mechanical behaviour of human organs in real time comes from the use of machine learning (ML) techniq…
Deep Learning-Based Real-Time Object Detection in Inland Navigation
2019
International audience; Semi-autonomous and fully-autonomous systems must have knowledge about the objects in their environment to ensure a safe navigation. Modern approaches implement deep learning techniques to train a neural network for object detection. This project will study the effectiveness of using several promising algorithms such as Faster R-CNN, SSD, and different versions of YOLO, to detect, classify, and track objects in near real-time fluvial domain. Since no dataset is available for this purpose in literature, we first started by annotating a dataset of 2488 images with almost 35 400 annotations for training the convolutional neural network architectures. We made this data s…