6533b821fe1ef96bd127c1ad
RESEARCH PRODUCT
Sub-symbolic Encoding of Words
Giovanni PilatoAndrea MaggioSalvatore GaglioSalvatore GaglioGiorgio VassalloAlessandro Puglisisubject
Computer sciencebusiness.industryLatent semantic analysisWordNetLexical databaseSemanticscomputer.software_genreLexical setLexical itemLexicographySyntactic categoryArtificial intelligencebusinesscomputerNatural languageWord (computer architecture)Natural language processingdescription
A new methodology for sub-symbolic semantic encoding of words is presented. The methodology uses the WordNet lexical database and an ad hoc modified Sammon algorithm to associate a vector to each word in a semantic n-space. All words have been grouped according to the WordNet lexicographers’ files classification criteria: these groups have been called lexical sets. The word vector is composed by two parts: the first one, takes into account the belonging of the word to one of these lexical sets; the second one is related to the meaning of the word and it is responsible for distinguishing the word among the other ones of the same lexical set. The application of the proposed technique over all the words of WordNet would lead to an interesting instrument for the sub-symbolic processing of texts. The first experimental results show the effectiveness of the proposed approach.
year | journal | country | edition | language |
---|---|---|---|---|
2003-01-01 |