Towards interpretable classifiers with blind signal separation
Blind signal separation (BSS) is a powerful tool to open-up complex signals into component sources that are often interpretable. However, BSS methods are generally unsupervised, therefore the assignment of class membership from the elements of the mixing matrix may be sub-optimal. This paper proposes a three-stage approach using Fisher information metric to define a natural metric for the data, from which a Euclidean approximation can then be used to drive BSS. Results with synthetic data models of real-world high-dimensional data show that the classification accuracy of the method is good for challenging problems, while retaining interpretability.
Data Mining in Cancer Research [Application Notes
This article is not intended as a comprehensive survey of data mining applications in cancer. Rather, it provides starting points for further, more targeted, literature searches, by embarking on a guided tour of computational intelligence applications in cancer medicine, structured in increasing order of the physical scales of biological processes.
A Novel Semi-Supervised Methodology for Extracting Tumor Type-Specific MRS Sources in Human Brain Data
Background: The clinical investigation of human brain tumors often starts with a non-invasive imaging study, providing \ud information about the tumor extent and location, but little insight into the biochemistry of the analyzed tissue. Magnetic \ud Resonance Spectroscopy can complement imaging by supplying a metabolic fingerprint of the tissue. This study analyses \ud single-voxel magnetic resonance spectra, which represent signal information in the frequency domain. Given that a single \ud voxel may contain a heterogeneous mix of tissues, signal source identification is a relevant challenge for the problem of\ud tumor type classification from the spectroscopic signal.\ud Methodology/Princ…
Making nonlinear manifold learning models interpretable: The manifold grand tour
Smooth nonlinear topographic maps of the data distribution to guide a Grand Tour visualisation.Prioritisation of data linear views that are most consistent with data structure in the maps.Useful visualisations that cannot be obtained by other more classical approaches. Dimensionality reduction is required to produce visualisations of high dimensional data. In this framework, one of the most straightforward approaches to visualising high dimensional data is based on reducing complexity and applying linear projections while tumbling the projection axes in a defined sequence which generates a Grand Tour of the data. We propose using smooth nonlinear topographic maps of the data distribution to…