Search results for "DATA MINING"

showing 10 items of 907 documents

Modeling of female human body shapes for apparel design based on cross mean sets

2014

This paper is concerned with a method to build prototypes of human bodies that can be used for apparel design. One of the most important issues in the apparel development process is to define a sizing system to provide a good fitting for the majority of the population. Since anthropometric measures do not present the same linear growth with size in each dimension, it is very important to find a prototype that represents as accurately as possible each class in the sizing system. In this paper we propose a method based on the concept of random compact mean set to define prototypes in apparel design. From a cloud of 3D points obtained with a 3D scanner a solid that represents the human body is…

education.field_of_studyComputer sciencePopulationGeneral EngineeringClass (philosophy)Sample (statistics)Mean setsHuman bodies prototypescomputer.software_genre3D shapeSizingComputer Science ApplicationsSet (abstract data type)Artificial IntelligenceData miningDimension (data warehouse)educationcomputerRandom compact setsSimulationRealization (probability)Expert Systems with Applications
researchProduct

Bayesian Hierarchical Models for Random Routes in Finite Populations

1996

In many practical situations involving sampling from finite populations, it is not possible (or it is prohibitely expensive) to access, or to even produce, a listing of all of the units in the population. In these situations, inferences can not be based on random samples from the population. Random routes are widely used procedures to collect data in absence of well defined sampling frames, and they usually have either been improperly analyzed as random samples, or entirely ignored as useless. We present here a Bayesian analysis of random routes that incorporates the information provided but carefully takes into account the non- randomness in the selection of the units.

education.field_of_studyComputer sciencePosterior probabilityPopulationBayesian probabilitySampling (statistics)Conditional probability distributioncomputer.software_genresymbols.namesakesymbolsData miningeducationcomputerSelection (genetic algorithm)RandomnessGibbs sampling
researchProduct

Expert system for predicting unstable angina based on Bayesian networks

2013

The use of computer-based clinical decision support (CDS) tools is growing significantly in recent years. These tools help reduce waiting lists, minimise patient risks and, at the same time, optimise the cost health resources. In this paper, we present a CDS application that predicts the probability of having unstable angina based on clinical data. Due to the characteristics of the variables (mostly binary) a Bayesian network model was chosen to support the system. Bayesian-network model was constructed using a population of 1164 patients, and subsequently was validated with a population of 103 patients. The validation results, with a negative predictive value (NPV) of 91%, demonstrate its …

education.field_of_studyUnstable anginaComputer sciencebusiness.industryPopulationGeneral EngineeringBayesian networkcomputer.software_genremedicine.diseaseClinical decision support systemExpert systemComputer Science ApplicationsArtificial IntelligencemedicineWeb applicationData miningeducationbusinesscomputerExpert Systems with Applications
researchProduct

Application of a Bayesian Spatiotemporal Surveillance Method to NYC Syndromic Data

2014

Incorporating prior knowledge (e.g., the spatial distribution of zip codes and background population effects) into a model using Bayesian methods could potentially improve outbreak detection. We adapted a previously described Bayesian model-based spatiotemporal surveillance technique to daily respiratory syndrome counts in NYC Emergency Department data in 2009, the year of the H1N1 influenza pandemic. Citywide, 56 alarms were produced across 15 zip codes, all during days of elevated respiratory visits. Future work includes evaluating our choice of baseline length, considering other alarm thresholds, and conducting a formal evaluation of the method across five syndromes in NYC.

education.field_of_studybusiness.industryBayesian probabilityH1N1 influenzaPopulationEmergency departmentISDS 2013 Conference Abstractscomputer.software_genreBayesian inferenceZip codeFormal evaluationspatiotemporal dataPandemicoutbreak detectionGeneral Earth and Planetary SciencesMedicinesyndromic surveillanceData miningbusinesseducationcomputerCartographyBayesian modelsGeneral Environmental ScienceOnline Journal of Public Health Informatics
researchProduct

Anomaly Detection in Dynamic Social Systems Using Weak Estimators

2009

Anomaly detection involves identifying observationsthat deviate from the normal behavior of a system. One ofthe ways to achieve this is by identifying the phenomena thatcharacterize “normal” observations. Subsequently, based on thecharacteristics of data learned from the “normal” observations,new observations are classified as being either “normal” or not.Most state-of-the-art approaches, especially those which belongto the family parameterized statistical schemes, work under theassumption that the underlying distributions of the observationsare stationary. That is, they assume that the distributions thatare learned during the training (or learning) phase, thoughunknown, are not time-varyin…

education.field_of_studybusiness.industryComputer sciencePopulationEstimatorMachine learningcomputer.software_genreOutlierAnomaly detectionArtificial intelligenceData miningAnomaly (physics)businesseducationcomputer2009 International Conference on Computational Science and Engineering
researchProduct

Prognostic and Functional Significant of Heat Shock Proteins (HSPs) in Breast Cancer Unveiled by Multi-Omics Approaches

2021

Simple Summary In this study, we investigated the expression pattern and prognostic significance of the heat shock proteins (HSPs) family members in breast cancer (BC) by using several bioinformatics tools and proteomics investigations. Our results demonstrated that, collectively, HSPs were deregulated in BC, acting as both oncogene and onco-suppressor genes. In particular, two different HSP-clusters were significantly associated with a poor or good prognosis. Interestingly, the HSPs deregulation impacted gene expression and miRNAs regulation that, in turn, affected important biological pathways involved in cell cycle, DNA replication, and receptors-mediated signaling. Finally, the proteomi…

endocrine systemHSPschemical and pharmacologic phenomenaBiologymedicine.disease_causeProteomicsArticleGeneral Biochemistry Genetics and Molecular Biologybreast cancerproteomicsHeat shock proteinexpressionmicroRNAmedicineHSPEpithelial–mesenchymal transitionlcsh:QH301-705.5GeneproteomicGeneral Immunology and MicrobiologyCancerhemic and immune systemsdata miningCell cyclemedicine.diseaselcsh:Biology (General)biological sciencesmiRNAsCancer researchprognosisGeneral Agricultural and Biological SciencesCarcinogenesisprognosiBiology
researchProduct

FPGA-based Acceleration of Detecting Statistical Epistasis in GWAS

2014

Abstract Genotype-by-genotype interactions (epistasis) are believed to be a significant source of unexplained genetic variation causing complex chronic diseases but have been ignored in genome-wide association studies (GWAS) due to the computational burden of analysis. In this work we show how to benefit from FPGA technology for highly parallel creation of contingency tables in a systolic chain with a subsequent statistical test. We present the implementation for the FPGA-based hardware platform RIVYERA S6-LX150 containing 128 Xilinx Spartan6-LX150 FPGAs. For performance evaluation we compare against the method iLOCi[9]. iLOCi claims to outperform other available tools in terms of accuracy.…

epistasis020203 distributed computing0303 health sciencesXeonWorkstationComputer scienceGenome-wide association study02 engineering and technologycomputer.software_genrelaw.inventioncontingency tables03 medical and health sciencesAccelerationFPGA technologylaw0202 electrical engineering electronic engineering information engineeringGeneral Earth and Planetary SciencesEpistasisGWASData miningpairwise gene-gene interactionField-programmable gate arraycomputer030304 developmental biologyGeneral Environmental ScienceProcedia Computer Science
researchProduct

The Influence of Student Abilities and High School on Student Growth: A Case Study of Chinese National College Entrance Exam

2019

Enabled by available educational data and data mining techniques, educational data analysis has become a hot topic. Current researches mainly focus on the prediction of problems and performance rather than revealing the underlying causal relationships. Based on a unique exam data, we extracted the abilities of examinee from HSEE (High School Entrance Exam) based on the knowledge of educational experts, then we measured student growth from middle school to high school in total score and subject scores. We studied the impact of high school ranking and student abilities of HSEE on student growth by multiple linear regression model, in which high school ranking is divided into 5 levels, Level 1…

evaluationGeneral Computer Scienceeducational data miningEducational dataeducationGeneral EngineeringEquity (finance)Multiple linear regression modelEducational data miningEntrance examRankingstudent abilityMathematics educationGeneral Materials ScienceActive listeninglcsh:Electrical engineering. Electronics. Nuclear engineeringElectrical and Electronic EngineeringStudent growthlcsh:TK1-9971high school ranking
researchProduct

Balanced Large Scale Knowledge Matching Using LSH Forest

2015

Evolving Knowledge Ecosystems were proposed recently to approach the Big Data challenge, following the hypothesis that knowledge evolves in a way similar to biological systems. Therefore, the inner working of the knowledge ecosystem can be spotted from natural evolution. An evolving knowledge ecosystem consists of Knowledge Organisms, which form a representation of the knowledge, and the environment in which they reside. The environment consists of contexts, which are composed of so-called knowledge tokens. These tokens are ontological fragments extracted from information tokens, in turn, which originate from the streams of information flowing into the ecosystem. In this article we investig…

evolving knowledge ecosystemsInformation retrievalComputer sciencebusiness.industryBig data02 engineering and technologyKnowledge ecosystemcomputer.software_genreLSH forestbig data020204 information systemsSchema (psychology)0202 electrical engineering electronic engineering information engineeringOntology020201 artificial intelligence & image processingData mininglocality-sensitive hashingbusinesscomputer
researchProduct

Exploiting Data Analytics and Deep Learning Systems to Support Pavement Maintenance Decisions

2021

Road networks are critical infrastructures within any region and it is imperative to maintain their conditions for safe and effective movement of goods and services. Road Management, therefore, plays a key role to ensure consistent efficient operation. However, significant resources are required to perform necessary maintenance activities to achieve and maintain high levels of service. Pavement maintenance can typically be very expensive and decisions are needed concerning planning and prioritizing interventions. Data are key towards enabling adequate maintenance planning but in many instances, there is limited available information especially in small or under-resourced urban road authorit…

feature importancepavement management systemComputer science0211 other engineering and technologiespavement maintenance decision02 engineering and technologypavement management systemslcsh:Technologylcsh:ChemistryGoods and services021105 building & construction0502 economics and business11. SustainabilitySettore ICAR/04 - Strade Ferrovie Ed AeroportiGeneral Materials Scienceroad asset databasesInstrumentationlcsh:QH301-705.5Fluid Flow and Transfer Processes050210 logistics & transportationbusiness.industryLevel of servicelcsh:TProcess Chemistry and TechnologyDeep learning05 social sciencesGeneral EngineeringPavement managementdeep learningTimelinedata mininglcsh:QC1-999Computer Science Applicationsroad asset databaseWorkflowRisk analysis (engineering)lcsh:Biology (General)lcsh:QD1-999lcsh:TA1-2040Key (cryptography)Settore ICAR/17 - DisegnoArtificial intelligencepavement maintenance decisionsbusinesslcsh:Engineering (General). Civil engineering (General)Predictive modellinglcsh:PhysicsApplied Sciences
researchProduct