6533b851fe1ef96bd12a997f
RESEARCH PRODUCT
An A* Based Semantic Tokenizer for Increasing the Performance of Semantic Applications
Arianna PipitoneMaria Carmela CampisiRoberto Pirronesubject
Settore ING-INF/05 - Sistemi Di Elaborazione Delle InformazioniInformation retrievalComputer sciencebusiness.industrySemantic searchOntology (information science)computer.software_genreSemantic tokenizer ontology A* tree search UIMASet (abstract data type)Semantic similaritySearch algorithmSemantic computingSemantic analyticsArtificial intelligencebusinesscomputerWord (computer architecture)Natural language processingdescription
Semantic Applications (SAs) makes use of ontolo- gies and their performance can depend on the syntactic labels of the modeled entities; even if several approaches have been devised to formalize ontologies, no formal approaches have been devised for naming their constituents, which look as long word concatenations without any particular separation. We present a novel semantic tokenizer that finds the sub-words through an application of the A* based search algorithm; the A* functions rely on a set of linguistic criteria and on the meta-cognitive perspective of the activity of reading.
year | journal | country | edition | language |
---|---|---|---|---|
2013-09-01 | 2013 IEEE Seventh International Conference on Semantic Computing |