Search results for "artificial intelligence"
showing 10 items of 6122 documents
Semisupervised kernel orthonormalized partial least squares
2012
This paper presents a semisupervised kernel orthonormalized partial least squares (SS-KOPLS) algorithm for non-linear feature extraction. The proposed method finds projections that minimize the least squares regression error in Hilbert spaces and incorporates the wealth of unlabeled information to deal with small size labeled datasets. The method relies on combining a standard RBF kernel using labeled information, and a generative kernel learned by clustering all available data. The positive definiteness of the kernels is proven, and the structure and information content of the derived kernels is studied. The effectiveness of the proposed method is successfully illustrated in standard UCI d…
Statistical methods for texture analysis applied to agronomical images
2008
For activities of agronomical research institute, the land experimentations are essential and provide relevant information on crops such as disease rate, yield components, weed rate... Generally accurate, they are manually done and present numerous drawbacks, such as penibility, notably for wheat ear counting. In this case, the use of color and/or texture image processing to estimate the number of ears per square metre can be an improvement. Then, different image segmentation techniques based on feature extraction have been tested using textural information with first and higher order statistical methods. The Run Length method gives the best results closed to manual countings with an averag…
Semisupervised Kernel Feature Extraction for Remote Sensing Image Analysis
2014
This paper presents a novel semisupervised kernel partial least squares (KPLS) algorithm for nonlinear feature extraction to tackle both land-cover classification and biophysical parameter retrieval problems. The proposed method finds projections of the original input data that align with the target variable (labels) and incorporates the wealth of unlabeled information to deal with low-sized or underrepresented data sets. The method relies on combining two kernel functions: the standard radial-basis-function kernel based on labeled information and a generative, i.e., probabilistic, kernel directly learned by clustering the data many times and at different scales across the data manifold. Th…
Merging the transform step and the quantization step for Karhunen-Loeve transform based image compression
2000
Transform coding is one of the most important methods for lossy image compression. The optimum linear transform - known as Karhunen-Loeve transform (KLT) - was difficult to implement in the classic way. Now, due to continuous improvements in neural network's performance, the KLT method becomes more topical then ever. We propose a new scheme where the quantization step is merged together with the transform step during the learning phase. The new method is tested for different levels of quantization and for different types of quantizers. Experimental results presented in the paper prove that the new proposed scheme always gives better results than the state-of-the-art solution.
Noise Robustness Analysis of Point Cloud Descriptors
2013
In this paper, we investigate the effect of noise on 3D point cloud descriptors. Various types of point cloud descriptors have been introduced in the recent years due to advances in computing power, which makes processing point cloud data more feasible. Most of these descriptors describe the orientation difference between pairs of 3D points in the object and represent these differences in a histogram. Earlier studies dealt with the performances of different point cloud descriptors; however, no study has ever discussed the effect of noise on the descriptors performances. This paper presents a comparison of performance for nine different local and global descriptors amidst 10 varying levels o…
Quality based classification of gasoline samples by ATR-FTIR spectrometry using spectral feature selection with quadratic discriminant analysis
2013
Abstract A chemometric approach has been developed for characterization of gasoline samples regarding their quality. Attenuated total reflectance – infrared spectrometric data were processed by genetic algorithm (GA) and successive projection algorithm (SPA) feature selection techniques, being employed as an initial step prior to apply a discriminative tool. It was aimed to classify the fuel samples according to their quality passed/failed data. Chemometric predictive procedures were developed using quadratic discriminant analysis (QDA) combined with GA and SPA as a feature subset and feature selection strategy. Results showed 93.3% and 95.6% accuracy for SPA-QDA and GA-QDA models respectiv…
Some applications of a theorem of Shirshov to language theory
1983
Some applications of a theorem of Shirshov to language theory are given: characterization of regular languages, characterization of bounded languages, and a sufficient condition for a language to be Parikh-bounded.
Multi-dimensional pattern matching with dimensional wildcards
1995
We introduce a new multi-dimensional pattern matching problem, which is a natural generalization of the on-line search in string matching. We are given a text matrix A[1: n1, ..., 1:n d ] of size N= n1×n2×...×n d , which we may preprocess. Then, we are given, online, an r-dimensional pattern matrix B[1:m1,...,1:m r ] of size M= m1×m2×...×m r , with 1≤r≤d. We would like to know whether B*=B*[*, 1:m1,*, ...,1: mr, *] occurs in A, where * is a dimensional wildcard such that B* is any d-dimensional matrix having size 1 × ... × m1×...1×m r ×...1 and containing the same elements as B. Notice that there might be (d/r)≤2d occurrences of B* for each position of A. We give CRCW-PRAM algorithms for pr…
A generalizability measure for program synthesis with genetic programming
2021
The generalizability of programs synthesized by genetic programming (GP) to unseen test cases is one of the main challenges of GP-based program synthesis. Recent work showed that increasing the amount of training data improves the generalizability of the programs synthesized by GP. However, generating training data is usually an expensive task as the output value for every training case must be calculated manually by the user. Therefore, this work suggests an approximation of the expected generalization ability of solution candidates found by GP. To obtain candidate solutions that all solve the training cases, but are structurally different, a GP run is not stopped after the first solution …
The new world of human genetic technologies: The policy environment and impacts of genetic screening tests
1995
Today it is possible to screen for mutated DNA sequences which do not induce any diseases but predispose to develop diseases under certain environmental condition. These latter disorders are called “multifactorial” since they result from the interplay of genetic and environmental factors. Among multifactorial disorders there are job-related diseases whose genetic component can be identified by genetic screening tests. The use of these tests to predict occupational disorders, to cut down on them, and to save costs—in particular for absenteeism, health care, and lawsuits—is of interest to employers and insurers. As for employees, it could entail an extremely deep invasion of privacy, economic…