Search results for "Component analysis"
showing 10 items of 562 documents
Comparison of different assembly and annotation tools on analysis of simulated viral metagenomic communities in the gut
2013
Abstract Background The main limitations in the analysis of viral metagenomes are perhaps the high genetic variability and the lack of information in extant databases. To address these issues, several bioinformatic tools have been specifically designed or adapted for metagenomics by improving read assembly and creating more sensitive methods for homology detection. This study compares the performance of different available assemblers and taxonomic annotation software using simulated viral-metagenomic data. Results We simulated two 454 viral metagenomes using genomes from NCBI's RefSeq database based on the list of actual viruses found in previously published metagenomes. Three different ass…
Learning non-linear time-scales with kernel -filters
2009
A family of kernel methods, based on the @c-filter structure, is presented for non-linear system identification and time series prediction. The kernel trick allows us to develop the natural non-linear extension of the (linear) support vector machine (SVM) @c-filter [G. Camps-Valls, M. Martinez-Ramon, J.L. Rojo-Alvarez, E. Soria-Olivas, Robust @c-filter using support vector machines, Neurocomput. J. 62(12) (2004) 493-499.], but this approach yields a rigid system model without non-linear cross relation between time-scales. Several functional analysis properties allow us to develop a full, principled family of kernel @c-filters. The improved performance in several application examples suggest…
Assessing the territorial influence of an Iberian worship site. The chemical characterisation of the terracotta from the Iron Age sanctuary of La Ser…
2017
This paper presents the study of the prestigious terracotta votive figurines from the Iberian Iron Age sanctuary of La Serreta (Alicante province, Spain) composed of 174 items. Portable X-ray fluorescence (PXRF) was used to identify elemental markers that permit us to observe the differences between local and non-local terracotta figurines and furthermore to evaluate the geographical influence of the La Serreta sanctuary using Principal Component Analysis (PCA). The Partial Least Squares Discriminant Analysis (PLSDA) statistical method was also used to classify the figurines of uncertain geographical origin. The resulting groups were related to typological and stylistic groups of figurines …
Syntagmatic and Paradigmatic Associations in Information Retrieval
2003
It is shown that unconscious associative processes taking place in the memory of a searcher during the formulation of a search query in information retrieval — such as the production of free word associations and the generation of synonyms — can be simulated using statistical models that analyze the distribution of words in large text corpora. The free word associations as produced by subjects on presentation of stimulus words can be predicted by applying first-order statistics to the frequencies of word co-occurrences as observed in texts. The generation of synonyms can also be conducted on co-occurrence data but requires second-order statistics. Both approaches are compared and validated …
Monitoring fire-affected areas using Thematic Mapper data
2001
In this paper three methods for updating inventories of burned areas have been presented and examined. They include Multitemporal Principal Component Analysis (MPCA), Change Vector Analysis (CVA) a...
MuLiMs-MCoMPAs: A Novel Multiplatform Framework to Compute Tensor Algebra-Based Three-Dimensional Protein Descriptors
2019
This report introduces the MuLiMs-MCoMPAs software (acronym for Multi-Linear Maps based on N-Metric and Contact Matrices of 3D Protein and Amino-acid weightings), designed to compute tensor-based 3D protein structural descriptors by applying two- and three-linear algebraic forms. Moreover, these descriptors contemplate generalizing components such as novel 3D protein structural representations, (dis)similarity metrics, and multimetrics to extract geometrical related information between two and three amino acids, weighting schemes based on amino acid properties, matrix normalization procedures that consider simple-stochastic and mutual probability transformations, topological and geometrical…
A novel dynamic multi-model relevance feedback procedure for content-based image retrieval
2016
This paper deals with the problem of image retrieval in large databases with a big semantic gap by a relevance feedback procedure. We present a novel algorithm for modelling the users's preferences in the content-based image retrieval system.The proposed algorithm considers the probability of an image belonging to the set of those sought by the user, and estimates the parameters of several local logistic regression models whose inputs are the low-level image features. A Principal Component Analysis method is applied to the original vector to reduce its high dimensionality. The relevance probabilities predicted by these local models are combined by means of a weighted average. These weights …
A rapid method for the differentiation of yeast cells grown under carbon and nitrogen-limited conditions by means of partial least squares discrimina…
2012
This paper shows the ease of application and usefulness of mid-IR measurements for the investigation of orthogonal cell states on the example of the analysis of Pichia pastoris cells. A rapid method for the discrimination of entire yeast cells grown under carbon and nitrogen-limited conditions based on the direct acquisition of mid-IR spectra and partial least squares discriminant analysis (PLS-DA) is described. The obtained PLS-DA model was extensively validated employing two different validation strategies: (i) statistical validation employing a method based on permutation testing and (ii) external validation splitting the available data into two independent sub-sets. The Variable Importa…
Time Trends in the Joint Distributions of Income and Age
2001
We propose a method of analyzing time changes of joint income-age densities. Change is decomposed into time invariant components which act on the densities as deformations with time varying strength. The functional form of these components is estimated non parametrically from cross sectional data. The method is applied to analyze British household data on income and age for the years 1968–95. It is learned that for the young and middle aged there is a trend towards increasing inequality, while during the early eighties there seems to occur a reversal in the evolution of the income distribution for the old.
Suspended particulate matter fluxes along with their associated metals, organic matter and carbonates in a coastal Mediterranean area affected by min…
2016
International audience; A study of suspended particulate matter (SPM) fluxes along with their associated metals, organic matter and carbonates, was conducted off the Mejerda River outlet in May 2011 and in March and July 2012 at depths of 10, 20 and 40 m using sediment traps. SPM fluxes are more significant near the Mejerda outlet, especially in winter, but dissipate further offshore. Normalization reveals that the Mejerda is a major source of Pb, Zn, Cd, Cu, Ni, and Co, all of which are the result of human activities. In contrast, Fe, Mn and N are of authigenic origin. The enrichment factor shows that Pb, Zn and especially Cd are the most highly polluting metals off the Mejerda outlet. Thi…