Search results for " mining"

showing 10 items of 1548 documents

X!TandemPipeline: a tool to manage sequence redundancy for protein inference and phosphosite identification

2017

X!TandemPipeline is a software designed to perform protein inference and to manage redundancy in the results of phosphosite identification by database search. It provides the minimal list of proteins or phosphosites that are present in a set of samples using grouping algorithms based on the principle of parsimony. Regarding proteins, a two-level classification is performed, where groups gather proteins sharing at least one peptide and subgroups gather proteins that are not distinguishable according to the identified peptides. Regarding phosphosites, an innovative approach based on the concept of phosphoisland is used to gather overlapping phosphopeptides. The graphical interface of X!Tandem…

0106 biological sciences0301 basic medicinePhosphopeptidesProteomicsphosphopeptideComputer sciencecomputer.internet_protocolcomputer.software_genre01 natural sciencesBiochemistrydatabase search03 medical and health sciencesSearch engineUser-Computer InterfaceRedundancy (information theory)SoftwareTandem Mass Spectrometry[ INFO.INFO-BI ] Computer Science [cs]/Bioinformatics [q-bio.QM]HumansDatabase search engineAmino Acid SequenceDatabases ProteinGraphical user interfacemass spectrometrybusiness.industrysoftwareprotein inferenceProteinsGeneral ChemistrybioinformaticsSearch EngineBenchmarking030104 developmental biologyComputingMethodologies_PATTERNRECOGNITIONProtein inferenceData mining[INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM]businesscomputerXMLAlgorithms010606 plant biology & botany
researchProduct

Model‐based approaches to unconstrained ordination

2014

Summary Unconstrained ordination is commonly used in ecology to visualize multivariate data, in particular, to visualize the main trends between different sites in terms of their species composition or relative abundance. Methods of unconstrained ordination currently used, such as non-metric multidimensional scaling, are algorithm-based techniques developed and implemented without directly accommodating the statistical properties of the data at hand. Failure to account for these key data properties can lead to misleading results. A model-based approach to unconstrained ordination can address this issue, and in this study, two types of models for ordination are proposed based on finite mixtu…

0106 biological sciencesComputer science010604 marine biology & hydrobiologyEcological ModelingModel selectionLatent variableMixture modelcomputer.software_genre010603 evolutionary biology01 natural sciencesData typeStatistical inferenceOrdinationMultidimensional scalingData miningLatent variable modelcomputerEcology Evolution Behavior and SystematicsMethods in Ecology and Evolution
researchProduct

A Methodology to Derive Global Maps of Leaf Traits Using Remote Sensing and Climate Data

2018

This paper introduces a modular processing chain to derive global high-resolution maps of leaf traits. In particular, we present global maps at 500 m resolution of specific leaf area, leaf dry matter content, leaf nitrogen and phosphorus content per dry mass, and leaf nitrogen/phosphorus ratio. The processing chain exploits machine learning techniques along with optical remote sensing data (MODIS/Landsat) and climate data for gap filling and up-scaling of in-situ measured leaf traits. The chain first uses random forests regression with surrogates to fill gaps in the database (> 45% of missing entries) and maximizes the global representativeness of the trait dataset. Plant species are then a…

0106 biological sciencesFOS: Computer and information sciences010504 meteorology & atmospheric sciencesSpecific leaf areaClimateBos- en LandschapsecologieSoil ScienceFOS: Physical sciencesApplied Physics (physics.app-ph)010603 evolutionary biology01 natural sciencesStatistics - ApplicationsGoodness of fitAbundance (ecology)Machine learningForest and Landscape EcologyApplications (stat.AP)Computers in Earth SciencesPlant ecologyVegetatie0105 earth and related environmental sciencesRemote sensingMathematics2. Zero hungerPlant traitsVegetationData stream miningClimate; Landsat; Machine learning; MODIS; Plant ecology; Plant traits; Random forests; Remote sensing; Soil Science; Geology; Computers in Earth SciencesGlobal MapRegression analysisGeologyPhysics - Applied Physics15. Life on landRandom forestsRemote sensingPE&RCRandom forestMODISTraitVegetatie Bos- en LandschapsecologieVegetation Forest and Landscape EcologyLandsat
researchProduct

Spatio-Temporal model structures with shared components for semi-continuous species distribution modelling

2017

Abstract Understanding the spatio-temporal dynamism and environmental relationships of species is essential for the conservation of natural resources. Many spatio-temporally sampled processes result in continuous positive [ 0 , ∞ ) abundance datasets that have many zero values observed in areas that lie outside their optimum niche. In such cases the most common option is to use two-part or hurdle models, which fit independent models and consequently independent environmental effects to occurrence and conditional-to-presence abundance. This may be correct in some cases, but not as much in others where the detection probability is related to the abundance. The aim of this work is to infer the…

0106 biological sciencesStatistics and ProbabilityProcess (engineering)Computer science010604 marine biology & hydrobiologyNicheManagement Monitoring Policy and Lawcomputer.software_genre01 natural sciencesNatural resourceEnvironmental niche modelling010104 statistics & probabilityAbundance (ecology)Component (UML)Data miningDynamism0101 mathematicsComputers in Earth SciencescomputerBayesian krigingSpatial Statistics
researchProduct

Tracking the outbreak. An optimized delimiting survey strategy for Xylella fastidiosa

2020

SummaryCurrent legislation enforces the implementation of intensive surveillance programs for quarantine plant pathogens. After an outbreak, surveys are implemented to delimit the geographic extent of the pathogen and execute disease control. The feasibility of control programs is highly dependent on budget availability, thus it is necessary to target and optimize surveillance strategies.A sequential adaptive delimiting survey involving a three-phase and a two-phase design with increasing spatial resolution was developed and implemented for the Xylella fastidiosa outbreak in Alicante, Spain. Inspection and sampling intensities were optimized using simulation-based methods and results were v…

0106 biological sciencesbiologyComputer scienceOutbreakSampling (statistics)computer.software_genrebiology.organism_classification010603 evolutionary biology01 natural sciencesDisease controlData miningXylella fastidiosacomputer010606 plant biology & botany
researchProduct

Rings for Privacy: an Architecture for Large Scale Privacy-Preserving Data Mining

2021

This article proposes a new architecture for privacy-preserving data mining based on Multi Party Computation (MPC) and secure sums. While traditional MPC approaches rely on a small number of aggregation peers replacing a centralized trusted entity, the current study puts forth a distributed solution that involves all data sources in the aggregation process, with the help of a single server for storing intermediate results. A large-scale scenario is examined and the possibility that data become inaccessible during the aggregation process is considered, a possibility that traditional schemes often neglect. Here, it is explicitly examined, as it might be provoked by intermittent network connec…

020203 distributed computingInformation privacyDistributed databasesDistributed databaseSettore ING-INF/03 - TelecomunicazioniComputer scienceReliability (computer networking)Secure Multi-Party Computation02 engineering and technologycomputer.software_genreSecret sharingData Mining; Data privacy; Distributed databases; Peer-to-peer computing; Secret sharing; Secure Multi-Party ComputationComputational Theory and MathematicsHardware and ArchitectureServerSignal Processing0202 electrical engineering electronic engineering information engineeringSecure multi-party computationData MiningData miningPeer-to-peer computingC-means data mining Privacy secret sharing secure multi-party computationSecret sharingcomputerData privacy
researchProduct

District heating networks: enhancement of the efficiency

2019

International audience; During the decades the district heating's (DH) advantages (more cost-efficient heat generation and reduced air pollution) overcompensated the additional costs of transmission and distribution of the centrally produced thermal energy to consumers. Rapid increase in the efficiency of low-power heaters, development of separated low heat density areas in cities reduce the competitiveness of the large centralized DH systems in comparison with the distributed cluster-size networks and even local heating. Reduction of transmission costs, enhancement of the network efficiency by optimization of the design of the DH networks become a critical issue. The methodology for determ…

020209 energynetwork design02 engineering and technology7. Clean energyAutomotive engineeringReduction (complexity)JEL: C - Mathematical and Quantitative Methods/C.C4 - Econometric and Statistical Methods: Special Topics/C.C4.C45 - Neural Networks and Related Topicsbenchmarking methodologies11. Sustainability0202 electrical engineering electronic engineering information engineeringdistrict heatingbusiness.industry020208 electrical & electronic engineeringdata miningBenchmarkingJEL: O - Economic Development Innovation Technological Change and Growth/O.O1 - Economic Development/O.O1.O13 - Agriculture • Natural Resources • Energy • Environment • Other Primary Products[SHS.ECO]Humanities and Social Sciences/Economics and FinanceNetwork planning and designVariable (computer science)Transmission (telecommunications)13. Climate actionHeat generationKey (cryptography)Environmental sciencebusinessJEL: C - Mathematical and Quantitative Methods/C.C2 - Single Equation Models • Single Variables/C.C2.C24 - Truncated and Censored Models • Switching Regression Models • Threshold Regression ModelsThermal energyInsights into Regional Development
researchProduct

Consistent Clustering of Elements in Large Pairwise Comparison Matrices

2018

[EN] In multi-attribute decision making the number of decision elements under consideration may be huge, especially for complex, real-world problems. Typically these elements are clustered and then the clusters organized hierarchically to reduce the number of elements to be simultaneously handled. These decomposition methodologies are intended to bring the problem within the cognitive ability of decision makers. However, such methodologies have disadvantages, and it may happen that such a priori clustering is not clear, and/or the problem has previously been addressed without any grouping action. This is the situation for the case study we address, in which a panel of experts gives opinions…

0209 industrial biotechnologyAHP0211 other engineering and technologiesAnalytic hierarchy process02 engineering and technologycomputer.software_genreWater distribution system (WDS)Pairwise comparisonMatrix (mathematics)020901 industrial engineering & automationSettore ING-IND/17 - Impianti Industriali MeccaniciDecomposition (computer science)Cluster (physics)Cluster analysisMathematics021103 operations researchApplied MathematicsManagement and operation of a WDSComputational MathematicsIdentification (information)Miller’s magic number sevenA priori and a posterioriPairwise comparisonData miningMiller's magic number sevenMATEMATICA APLICADAcomputerDecision-making
researchProduct

Advances in Practical Applications of Agents, Multi-Agent Systems, and Sustainability: The PAAMS Collection

2015

This volume presents the papers that have been accepted for the 2015 special sessions of the 13th International Conference on Practical Applications of Agents and Multi-Agent Systems, held at University of Salamanca, Spain, at 3rd-5th June, 2015: Agents Behaviours and Artificial Markets (ABAM); Agents and Mobile Devices (AM); Multi-Agent Systems and Ambient Intelligence (MASMAI); Web Mining and Recommender systems (WebMiRes); Learning, Agents and Formal Languages (LAFLang); Agent-based Modeling of Sustainable Behavior and Green Economies (AMSBGE); Emotional Software Agents (SSESA) and Intelligent Educational Systems (SSIES). The volume also includes the paper accepted for the Doctoral Conso…

0209 industrial biotechnologyAmbient intelligenceManagement scienceComputer scienceMulti-agent system02 engineering and technologyRecommender systemComputingMethodologies_ARTIFICIALINTELLIGENCEEngineering management020901 industrial engineering & automationWeb miningSoftware agentSustainability0202 electrical engineering electronic engineering information engineering020201 artificial intelligence & image processingMobile deviceDissemination
researchProduct

Ant Colony Optimisation-Based Classification Using Two-Dimensional Polygons

2016

The application of Ant Colony Optimization to the field of classification has mostly been limited to hybrid approaches which attempt at boosting the performance of existing classifiers (such as Decision Trees and Support Vector Machines (SVM)) — often through guided feature reductions or parameter optimizations.

0209 industrial biotechnologyBoosting (machine learning)business.industryComputer scienceAnt colony optimization algorithmsDecision treePattern recognition02 engineering and technologyAnt colonycomputer.software_genreSwarm intelligenceSupport vector machineComputingMethodologies_PATTERNRECOGNITION020901 industrial engineering & automationKernel method0202 electrical engineering electronic engineering information engineering020201 artificial intelligence & image processingArtificial intelligenceData miningbusinesscomputer
researchProduct