0000000000133082

AUTHOR

Thomas Hassan

showing 2 related works from this author

Predictive and Evolutive Cross-Referencing for Web Textual Sources

2017

International audience; One of the main challenges in the domain of competitive intelligence is to harness important volumes of information from the web, and extract the most valuable pieces of information. As the amount of information available on the web grows rapidly and is very heterogeneous, this process becomes overwhelming for experts. To leverage this challenge, this paper presents a vision for a novel process that performs cross-referencing at web scale. This process uses a focused crawler and a semantic-based classifier to cross-reference textual items without expert intervention, based on Big Data and Semantic Web technologies. The system is described thoroughly, and interests of…

Competitive intelligenceComputer science[SPI] Engineering Sciences [physics]Big data02 engineering and technologyReasonningFocused crawlerDiscovery[INFO] Computer Science [cs]World Wide WebKnowledge-based systems[INFO.INFO-NI]Computer Science [cs]/Networking and Internet Architecture [cs.NI][SPI]Engineering Sciences [physics]020204 information systems0202 electrical engineering electronic engineering information engineeringLeverage (statistics)[INFO]Computer Science [cs]Semantic Web[INFO.INFO-NI] Computer Science [cs]/Networking and Internet Architecture [cs.NI]business.industryOntologyFocused CrawlerWork in processClassificationAdaptive[SPI.TRON] Engineering Sciences [physics]/Electronics[SPI.TRON]Engineering Sciences [physics]/ElectronicsCross-ReferencingClasssification020201 artificial intelligence & image processingbusinessClassifier (UML)Model
researchProduct

Semantic HMC for Big Data Analysis

2014

International audience; Analyzing Big Data can help corporations to im-prove their efficiency. In this work we present a new vision to derive Value from Big Data using a Semantic Hierarchical Multi-label Classification called Semantic HMC based in a non-supervised Ontology learning process. We also proposea Semantic HMC process, using scalable Machine-Learning techniques and Rule-based reasoning.

FOS: Computer and information sciences[ INFO.INFO-TT ] Computer Science [cs]/Document and Text Processingmulti-classifyComputer scienceComputer Science - Artificial IntelligenceBig data[ INFO.INFO-WB ] Computer Science [cs]/Websemantic technologies02 engineering and technologyOntology (information science)Semantic data model[ INFO.INFO-DC ] Computer Science [cs]/Distributed Parallel and Cluster Computing [cs.DC]Semantic similarity020204 information systemsSemantic computing0202 electrical engineering electronic engineering information engineeringontologyInformation retrievalOntology learningbusiness.industryOntology-based data integration[INFO.INFO-WB]Computer Science [cs]/WebBig-Data[INFO.INFO-TT]Computer Science [cs]/Document and Text ProcessingArtificial Intelligence (cs.AI)machine learningOntologySemantic technologyIndex Terms—classification020201 artificial intelligence & image processing[INFO.INFO-DC]Computer Science [cs]/Distributed Parallel and Cluster Computing [cs.DC]business
researchProduct