Search results for "Overfitting"

showing 10 items of 22 documents

State transition identification in multivariate time series (STIMTS) applied to rotational jump trajectories from single molecules

2018

Time resolved data from single molecule experiments often suffer from contamination with noise due to a low signal level. Identifying a proper model to describe the data thus requires an approach with sufficient model parameters without misinterpreting the noise as relevant data. Here, we report on a generalized data evaluation process to extract states with piecewise constant signal level from simultaneously recorded multivariate data, typical for multichannel single molecule experiments. The method employs the minimum description length principle to avoid overfitting the data by using an objective function, which is based on a tradeoff between fitting accuracy and model complexity. We val…

0301 basic medicinePhysicsNoise (signal processing)Monte Carlo methodGeneral Physics and AstronomyOverfittingSynthetic data03 medical and health sciencesTime resolved data030104 developmental biologyPiecewiseJumpStatistical physicsPhysical and Theoretical ChemistryMinimum description lengthThe Journal of Chemical Physics
researchProduct

Binary logistic regression versus stochastic gradient boosted decision trees in assessing landslide susceptibility for multiple-occurring landslide e…

2015

This study aims to compare binary logistic regression (BLR) and stochastic gradient treeboost (SGT) methods in assessing landslide susceptibility within the Mediterranean region for multiple-occurrence regional landslide events. A test area was selected in the north-eastern sector of Sicily (southern Italy) where thousands of debris flows and debris avalanches triggered on the first October 2009 due to an extreme storm. Exploiting the same set of predictors and the 2009 event landslide archive, BLR- and SGT-based susceptibility models have been obtained for the two catchments separately, adopting a random partition (RP) technique for validation. In addition, the models trained in one catchm…

Atmospheric ScienceSettore GEO/04 - Geografia Fisica E GeomorfologiaStormLandslideRegression analysisOverfittingForward logistic regressionLandslide susceptibilityDebris flowPrediction spatial transferabilityAltitudeMessina 2009 disasterNatural hazardEarth and Planetary Sciences (miscellaneous)Alternating decision treePhysical geographyStochastic gradient treeboostCartographySicilyGeologyWater Science and Technology
researchProduct

Bivariate nonlinear prediction to quantify the strength of complex dynamical interactions in short-term cardiovascular variability.

2005

A nonlinear prediction method for investigating the dynamic interdependence between short length time series is presented. The method is a generalization to bivariate prediction of the univariate approach based on nearest neighbor local linear approximation. Given the input and output series x and y, the relationship between a pattern of samples of x and a synchronous sample of y was approximated with a linear polynomial whose coefficients were estimated from an equation system including the nearest neighbor patterns in x and the corresponding samples in y. To avoid overfitting and waste of data, the training and testing stages of the prediction were designed through a specific out-of-sampl…

Bivariate time seriePhysics::Medical PhysicsBiomedical EngineeringBlood PressureBivariate analysisOverfittingCross-validationk-nearest neighbors algorithmCardiovascular Physiological PhenomenaHealth Information ManagementHeart RateTilt-Table TestStatisticsApplied mathematicsHumansComputer SimulationPredictabilityHeart rate variabilityMathematicsHealth InformaticBaroreflex controlSystolic arterial pressure variabilityUnivariateModels CardiovascularNonlinear predictionComputer Science Applications1707 Computer Vision and Pattern RecognitionComputer Science ApplicationsNonlinear systemComputational Theory and MathematicsNonlinear DynamicsLinear approximationMedicalbiological engineeringcomputing
researchProduct

Neural networks with non-uniform embedding and explicit validation phase to assess Granger causality

2015

A challenging problem when studying a dynamical system is to find the interdependencies among its individual components. Several algorithms have been proposed to detect directed dynamical influences between time series. Two of the most used approaches are a model-free one (transfer entropy) and a model-based one (Granger causality). Several pitfalls are related to the presence or absence of assumptions in modeling the relevant features of the data. We tried to overcome those pitfalls using a neural network approach in which a model is built without any a priori assumptions. In this sense this method can be seen as a bridge between model-free and model-based approaches. The experiments perfo…

Cognitive NeuroscienceEntropyFOS: Physical sciencesOverfittingcomputer.software_genreMachine learningGranger causalityArtificial IntelligenceMedicine and Health SciencesEntropy (information theory)Non-uniform embeddingComputer SimulationMathematicsArtificial neural networkbusiness.industryProbability and statisticsModels TheoreticalNeural Networks (Computer)ClassificationNeural networkAlgorithmCausalityPhysics - Data Analysis Statistics and ProbabilitySettore ING-INF/06 - Bioingegneria Elettronica E InformaticaGranger causalityEmbeddingA priori and a posterioriTransfer entropyNeural Networks ComputerArtificial intelligenceData miningbusinesscomputerAlgorithmsNeural networksData Analysis Statistics and Probability (physics.data-an)
researchProduct

BELM: Bayesian Extreme Learning Machine

2011

The theory of extreme learning machine (ELM) has become very popular on the last few years. ELM is a new approach for learning the parameters of the hidden layers of a multilayer neural network (as the multilayer perceptron or the radial basis function neural network). Its main advantage is the lower computational cost, which is especially relevant when dealing with many patterns defined in a high-dimensional space. This brief proposes a bayesian approach to ELM, which presents some advantages over other approaches: it allows the introduction of a priori knowledge; obtains the confidence intervals (CIs) without the need of applying methods that are computationally intensive, e.g., bootstrap…

Computer Networks and CommunicationsComputer scienceComputer Science::Neural and Evolutionary ComputationBayesian probabilityOverfittingMachine learningcomputer.software_genrePattern Recognition AutomatedReduction (complexity)Artificial IntelligenceComputer SimulationRadial basis functionExtreme learning machineArtificial neural networkbusiness.industryEstimation theoryBayes TheoremGeneral MedicineComputer Science ApplicationsMultilayer perceptronNeural Networks ComputerArtificial intelligencebusinesscomputerAlgorithmsSoftwareIEEE Transactions on Neural Networks
researchProduct

Estrategias para la elaboración de modelos estadísticos de regresión

2011

Multivariable regression models are widely used in health science research, mainly for two purposes: prediction and effect estimation. Various strategies have been recommended when building a regression model: a) use the right statistical method that matches the structure of the data; b) ensure an appropriate sample size by limiting the number of variables according to the number of events; c) prevent or correct for model overfitting; d) be aware of the problems associated with automatic variable selection procedures (such as stepwise), and e) always assess the performance of the final model in regard to calibration and discrimination measures. If resources allow, validate the prediction mo…

Estimationbusiness.industryCalibration (statistics)Sample size determinationMultivariable calculusStatisticsMedicineRegression analysisFeature selectionOverfittingCardiology and Cardiovascular MedicinebusinessRegression diagnosticRevista Española de Cardiología
researchProduct

Deep Q-Learning With Q-Matrix Transfer Learning for Novel Fire Evacuation Environment

2021

We focus on the important problem of emergency evacuation, which clearly could benefit from reinforcement learning that has been largely unaddressed. Emergency evacuation is a complex task which is difficult to solve with reinforcement learning, since an emergency situation is highly dynamic, with a lot of changing variables and complex constraints that makes it difficult to train on. In this paper, we propose the first fire evacuation environment to train reinforcement learning agents for evacuation planning. The environment is modelled as a graph capturing the building structure. It consists of realistic features like fire spread, uncertainty and bottlenecks. We have implemented the envir…

FOS: Computer and information sciencesComputer Science - Machine LearningComputer Science - Artificial IntelligenceComputer scienceQ-learningComputingMilieux_LEGALASPECTSOFCOMPUTINGSystems and Control (eess.SY)02 engineering and technologyOverfittingMachine Learning (cs.LG)FOS: Electrical engineering electronic engineering information engineering0202 electrical engineering electronic engineering information engineeringReinforcement learningElectrical and Electronic EngineeringVDP::Teknologi: 500::Informasjons- og kommunikasjonsteknologi: 550business.industry020206 networking & telecommunicationsComputer Science ApplicationsHuman-Computer InteractionArtificial Intelligence (cs.AI)Control and Systems EngineeringShortest path problemEmergency evacuationComputer Science - Systems and Control020201 artificial intelligence & image processingArtificial intelligenceTransfer of learningbusinessSoftwareIEEE Transactions on Systems, Man, and Cybernetics: Systems
researchProduct

A comprehensive study of automatic program repair on the QuixBugs benchmark

2021

Abstract Automatic program repair papers tend to repeatedly use the same benchmarks. This poses a threat to the external validity of the findings of the program repair research community. In this paper, we perform an empirical study of automatic repair on a benchmark of bugs called QuixBugs, which has been little studied. In this paper, (1) We report on the characteristics of QuixBugs; (2) We study the effectiveness of 10 program repair tools on it; (3) We apply three patch correctness assessment techniques to comprehensively study the presence of overfitting patches in QuixBugs. Our key results are: (1) 16/40 buggy programs in QuixBugs can be repaired with at least a test suite adequate pa…

FOS: Computer and information sciencesCorrectnessComputer science02 engineering and technologyOverfittingMachine learningcomputer.software_genreMaintenance engineeringExternal validityComputer Science - Software Engineering020204 information systems0202 electrical engineering electronic engineering information engineeringTest suite[INFO]Computer Science [cs]computer.programming_languagebusiness.industry020207 software engineeringSoftware maintenancePython (programming language)Software Engineering (cs.SE)Software bugHardware and ArchitectureBenchmark (computing)Artificial intelligencebusinesscomputerSoftwareInformation Systems
researchProduct

Automated Patch Assessment for Program Repair at Scale

2021

AbstractIn this paper, we do automatic correctness assessment for patches generated by program repair systems. We consider the human-written patch as ground truth oracle and randomly generate tests based on it, a technique proposed by Shamshiri et al., called Random testing with Ground Truth (RGT) in this paper. We build a curated dataset of 638 patches for Defects4J generated by 14 state-of-the-art repair systems, we evaluate automated patch assessment on this dataset. The results of this study are novel and significant: First, we improve the state of the art performance of automatic patch assessment with RGT by 190% by improving the oracle; Second, we show that RGT is reliable enough to h…

FOS: Computer and information sciencesGround truthCorrectnessComputer sciencebusiness.industryRandom testing020207 software engineering02 engineering and technologyOverfittingMachine learningcomputer.software_genreOracleSoftware Engineering (cs.SE)External validityComputer Science - Software Engineering020204 information systems0202 electrical engineering electronic engineering information engineering[INFO]Computer Science [cs]State (computer science)Artificial intelligencebusinessScale (map)computerSoftware
researchProduct

Classification of Heart Sounds Using Convolutional Neural Network

2020

Heart sounds play an important role in the diagnosis of cardiac conditions. Due to the low signal-to-noise ratio (SNR), it is problematic and time-consuming for experts to discriminate different kinds of heart sounds. Thus, objective classification of heart sounds is essential. In this study, we combined a conventional feature engineering method with deep learning algorithms to automatically classify normal and abnormal heart sounds. First, 497 features were extracted from eight domains. Then, we fed these features into the designed convolutional neural network (CNN), in which the fully connected layers that are usually used before the classification layer were replaced with a global averag…

Feature engineeringComputer science0206 medical engineeringconvolutional neural networkneuroverkot02 engineering and technologyOverfittingConvolutional neural networklcsh:Technologylcsh:Chemistry0202 electrical engineering electronic engineering information engineeringFeature (machine learning)General Materials ScienceSensitivity (control systems)sydäntauditInstrumentationlcsh:QH301-705.5Fluid Flow and Transfer Processesbusiness.industrylcsh:TProcess Chemistry and TechnologyDeep learning020208 electrical & electronic engineeringGeneral EngineeringPattern recognitiondiagnostiikkaMatthews correlation coefficientautomatic heart sound classification020601 biomedical engineeringlcsh:QC1-999Computer Science Applicationsfeature engineeringkoneoppiminenlcsh:Biology (General)lcsh:QD1-999lcsh:TA1-2040Heart soundsArtificial intelligencetiedonlouhintabusinesslcsh:Engineering (General). Civil engineering (General)lcsh:PhysicsApplied Sciences
researchProduct